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(54) Title: DIAGNOSIS OF DISEASE STATE USING MRNA PROFILES 
(57) Abstract 

Disclosed are diagnostic techniques for the detection of a human diseased state. Genetic probes and methods useful in monitoring 
the progression and diagnosis of the disease state are described. The invention relates particularly to probes and methods for evaluating the 
presence of RNA species that are differentially expressed in ^ individuals with the disease state compared to normal 

healthy individuals. Further disclosed is a multivariate diagnostic model for prostate cancer in a population of men with moderately elevated 
total serum PS A (> 2iQng/ml). Results of quantitative serum assays for the UC325 gene product [Interleukin-8 (IL-8)], total prostate 
specific antigen (*-PSA), as well as Free/Totel (fit fSA) ratios were combined to enhance the sensitivity of prostate cancer diagnosis in 
a denned urologic population diagnosed either ojrganrconflned prostate" cancer (clinical stage A & B), non^rgan^nnned prostate cancer 
(clinical stage C or D) or benign prostatic hyperplasia (BPk). The additional abiljr> of UC325 gene product serum levels to accurately 
stage prostate cancer independently of t-PSA of ^t PSA is disclos 
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(54) Titie: DIAGNOSIS OF DISEASE STATE USING MRNA PROFILES 
(57) Abstract 

Disclosed are diagnostic techniques for the detection of a human diseased state. Genetic probes and methods useful in monitoring 
the progression and diagnosis of the disease state are described. The invention relates particularly to probes and methods for evaluating the 
presence of RNA species that are different with the disease state compared to normal 

healthy individuals. Further disclosed' is a multivariate diagnostic model for prostate cancer in a population of men with moderately elevated 
tp^ serum PS*^ 2.0 ng/ml); Results of quantitative serum assay? for the UG^25 gene product Imterteukin-r8 (IL-8)), total prostate 
specific antigen 0-iPSA), as well as Fre^s/T^ the sensitivity of prostate cancer diagnosis in 

a defined umlogic population diagnosed either organ-confined prostate cancer (clinical stage A & B), non^rgan-confined prostate cancer 
(clinical stageCor-D) or benign prostatic hyper^&ia (BPH), The additional ability of UC325 gene product serum levels to accurately 
stage prostate can^r i 
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DESCRIPTION 

DIAGNOSIS OF DISEASE STATE USING MRNA PROFILES 
BACKGROUND OF THE INVENTION 

• 5' " v : ' ;r --- : -. • ■■■ ' ■ . , * : : : ■ 

A. Field of the Invention 
The present invention relates generally to the detection and diagnosis of human disease 

states and methods relating thereto: More particularly^ the present invention concerns probes and 
methods useful in diagnosing, identify ing and monitpring the progression of disease states through 
10 measurements of gene products. 

B. Descriptibri of the Related Art 

Genetic detection of human disease states is a rapidly developing field (Taparowsky et aL, 
1982; Siamon et aL, 1989; Sidransky et al y 1992; Miki etal 9 1994; Dong et al y 1995; Morahane/ 
15 al 9 1996; Lifton, 1996; Barinaga, 1996). One advantage presented by this field is that certain 
disease states may be detected by non-invasive means, e,g. sampling periph^ amniotic 
fluid. Affected individuals may be diagnosed early in disease progression; allowing more effective 
pMeht management with better clinical out^ 
: Some problems exist with this approach. v A number of knownog^iietic lesions merely 

2Q predispose to development of specific Asease ^te$. v Indw may 
-mot develop the disease state, while other individuals m^y deyelpp the disease state without 
possessing a particular genetic lesion. In human cancers,, genetip de^t^may potentially occur in a 
large number of known tumor suppressergenesandproto^ncpgCTtes. , s ; 

Tlie genetic detection of cancer has a long history. One o^ the earliest genetic lesions 
25. shown to predispose tp cancer was transforming point m^ the ray oncogenes (Taparowsky 

* et al 9 1982). Transforming fas point mutations may be detected in the stppl of individuals with 
benign and malignant colorectal tumors (Sidransky et ql 9 1992), However, only 50% of such 
tumorsjpontained a ras mutation (Sidransky et aL, 1 992). Similar results have been obtained with 
amplification of HER-2/new in breast and ovarian cancer (Slamon et al 9 198$), deletion and 
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mutation of p53 in bladder cancer (Sidransky et al, 1991), deletion of DCC in colorectal cancer 
(Fearon et al , 1 990) and mutation of BRCAJ in breast and ovarian cancer (Miki et al. , 1 994). 

None of these genetic lesions are capable of predicting a majority of individuals with cancer 
and most require direct sampling of a suspected tumor, making screening difficult. Further, none of 
the markers described above are capable of distinguishing between metastatic and non-metastatic 
forms of cancer. In effective management of cancer patients, identification of those individuals 
whose tumors have already metastasized or are likely to metastasize is critical. Because metastatic 
cancer kills 560,000 people in the US each year (ACS home page), identification of markers for 
metastatic cancer, such as metastatic prostate and breast cancer, would be an important advance. 

A particular problem in cancer detection and diagnosis occurs with prostate cancer. 
Carcinoma of the prostate (PCA) is the second-most frequent cause of male cancer-related death in 
the United States (Boring, 1993). The incidence of prostate cancer increased by 50% between 1980 
and 1990 (Stone et al, 1994). Although relatively few prostate tumors progress to clinical 
significance during the lifetime of the patient, those which are progressive in nature are likely to 
have metastasized by the time of detection. Survival rates for individuals with metastatic prostate 
cancer are quite low. Between these extremes are patients with prostate tumors that will 
metastasize but have not yet done so, for whom surgical prostate removal is curative. 
Determinationof Ayhich group a patient falls within is,criucal m detenmning optimal treatment and 
patient survival. 

Genetic changes reported to be associated with prostate cancer include: allelic loss (Bova, et 
al, 1993; Macoska era/., 1994; Carter e/fl/„ 1990); DNAhypermethylation (Isaacs et a/., 1994); 
point mutationsor deletions of the retinoblastoma (Rb) ana P 53 genes (Bookstein e/ ^/., 1990a; 
Bookstein */ a/., 1990b; Isaacs et at, 1991); and aneuploidy and aneusomy of chromdsontes 
detected by fluorescence in situ hybridization (FISH) (Macoska et ai, 1994;Yisakorpi«a/., 1994; 

Takahashi et al , 1 994; Alcaraz e/ al, 1994). 

A recentdevelopmentm this field was the identification of a prostate metastasis si^presser 

; e^ y ^ 

caused a significant decrease in metastatic tumor formation (Dong et al., 1995). However, 
detection of 1M1 mutations is dependent upon direct sampling of mutant, ptostate c^tts. Thus, 
athera primary prostate tumor must be sampled or else sufficienttransformed cells ihust be present 
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in blood, lymph nodes or other tissues to detect the iriissihg or abnormal gene. Further, the 

presence of a deleted gene may frequently be masked by large n 

may be present in a given tissue sample. 

The most commonly utilized current tests for prostate cancer are digital rectal examination 

5 (DKE) and analysis of serum prostate specific antigen (PSA). Although PSA has been widely used 

as ^clinical marker of prostate cancer since 1988^^^ 

utilizing PSA alone or in combination with digital rectal examination have not been successful in 
improving the survival rate for men with prostate cancer (Piirtiii & Oesterling, 1 994). While PSA is 
- specific to prostate tissue, it is produced by normal and benign as well as malignant prostatic 
1 0 epithelium, resulting in a high false-positive rate for prostate cancer detection (Partin & Oesterling, 

"^994): : ; ; - 

Other markers that have been used for prostate cancer detection include prostatic acid 
phosphatase (PAP) and prostate secreted protein (RSP). PAP is secreted by prostate cells under 
hormonal control (Brawn et ql y 1 996). It has less specificity and sensitivity than does PSA. As a 
15 result, it is used much less now, although PAP may still have some applications for monitoring 
metastatic patients that have failed primary treatments. In general, PSP is a more sensitive 
biomarker than PAP, but is not as sensitive as PSA (Huang et aL y 1993). Like PSA, PSP levels are 
frequently elevated in patira 

Another serum marker associated; with prostate disease is jprbstate specific membrane 
20 antigen (PSMA>^^^^ 1 987; Carter <?/t*i£, 1996; Mii^y e/ aA, I99^ PSMA is a 

Type H cell membrane protein and has been identified as IMic Acid Hydrolase (FAli) (Heston, 
i^96 9 Garict et ai 9 1996). Antibodies against PSMA react with <both normal prdstate tissue and 
prostate cancer tissue (Horoszewicz et ah 9 1987). Murphy et ah (1995) used ELISA to detefet 
serum PSMA in advanced prostate cancer. As a serum test, PSMA levels are a relatively poor 
25 indicator of prostate cancer. Hc^y^, ^ PSMA 
is expressed in metastatic prostate tumor capillary beds (Silver et al 9 1997) and is reported to be 
more abundant in the blood of metastatic ^ PSMA 
messenger RNA (rti^ ^ 
after exposure to 5-a-dihydroxytestosterone (DHT> (Israeli €f 4/ , 
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d,a g „ 0 s,s.buuh C u^ u , n e S sofft isnlaAeris ^ teageval ^, P ^ « 

I WJ). K I -|'{ K anahsis was used by di Celle e/ a/ /100^ ^ 

u . «». vwie e/ or/. (1994) to demonstrate IL-8 production in R. 

cell chrome lymphocytic leukemia. Vinante et al w ^ ,, : P auctaonmB- 

upregula.ion of II ft ,v • • } Northem «** analysis to~show 

P gulai.cn „f „.- 8 expression m acute myeJogenous ^ 

increase in serum levels of II ft in k ( ' OUnd 311 
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invention is that a disease state may be detected, diagnosed, or a prdgribsis may be derived by 
examining a blood sample rather than relying on a more invasive, or less sensitive test. In 
addition, a subject may be monitored for disease ptogfessioii, stetiis iand response to therapies 
through monitoring of differentially expressed disease marker^. In certain embodiments of ther 
5 present invention st "patient*' "individual" or "subject" may life an animal, including a laboratory 
animal or other aniiriakSpecies, or in certain embodiments a human subject. 

In certain embodiments of the invention the terms "expression", "gene expression** and 
"expression products*' may refer to either production of a riiaorker gehe RNA message or the RNA 
message produced or both. In certain bther embodiments of the invention the terms 

10 "expression", "gene expression" and "expression products" may refer to either translation of a 
marker RNA message "into proteins, polypeptides arid/of peptides, arid/or to the produced 
proteins, polypeptides, and/or peptides. Ih cettain aspects of Ihe invention a marker may be a 
gene whose expression is activated to a higher level in a patietit Mffenng a disease state, relative 
to its expression in a healthy subject. It is also understood that a dtfferentialiy expressed marker 

15 may be eit&er activated or inhibited at the nuclei^ aieid level ' ot prbtein level, or may it may, 
subject to alternative splicing to result in a diifferent polypeptide product. Such differences may 
be evidenced by a Change in itfRNA levels* surface expression, secretion or other partitioiiing of 
a pbl J^6pttde, for exaniiple. In certain aspect of the invention; ii marker ihay be a comparison of 
expresSioh between two br more marker geries, atid/bi: a cc5^ari$6ii of the iatibs of the 

20 egression "between or more marker genes, or evfen a comp^sdh ^ of tWeF difieffently '* 
processed products bf the same gene; which diffef between healthy ~ subjects and subjects 
Suif^^ * 

As demonstrated in the ex^ples includdd Bfeteiuj the present inVentdrs have identified 
cemM mafe liavb been applied for the detection of 

25 v W 

^ies miy be detected and riionitbiNE^ by SttrVeyihg'the re^bhse bf healthy iUtoititie delfe to the 
disease condition. As such, this novfel tti^od ^ of 
inarke^ liiat aire differentially expressed fii respHji^e ttf o 
^\^'a^^^'$^k^-^[ asthmd/ lupus er^eiti^^ 
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gravis, autoimmune thyroiditis, amyloid lateral sclerosis, interstitial cystitis, prostatitis or other 
systemic or chronic conditions. 

In a certain embodiment of the present invention, the inventors have demonstrated the 
ability to detect and discriminate between benign prostatic hyperplasia (BPH) and prostate cancer, 

5 using multivariate analysis with several different prostate disease markers. By combining test 
results for serum prostate specific antigen (PS A) and IL-8 gene products* it is possible to identify a 
significant proportion of individuals with, prostate cancer, while achieving close to one hundred 
percent accuracy in differentiatingbet>yeen individuals presenting with prostate cancer versus BPH. 
These levels of sensitivity and specificity represent .-significant advances over the prior art in 

10 prostata cancer detection and differentiation, which traditionally have been performed with 
univariate analysis with PSA, digital rectal examination and other techniques. It is further disclosed 
that levels of IL-8 gene product in the peripheral circulation may be used to discriminate advanced 
from localized stages of prostate cancer. 

It is an important aspect of the present invention that it is the response pf the normal 

15 blood lymphocytes that is being examined, rather than the prostate, breast or other disease cells 
themselves as in previous methods. As an aspect of the invention, certain mRNAs are identified 
that are differentially (expressed in normal cells, as a reaction to a disease state, relative to their 
expression in healthy subjects. Two of the metastatic cancer-markers disclosed herein represent 
previously unreported genes, with one of the two matching a small expressed s^uence t^ (EST) 

20 described in Genebank Accession # T03 01 3 and SI2Q ID NO: 1 , and another na^tchingthe ^uen6e 
disclose4 in SEQ IE) NO:2. ^o^er marker 1* 
alpha (Genebank Accession # X03558 and SEQ ID NO:3). Two other ^^ke^ rg>resent 
alten^ Genebank 
Ac<^io^ ^ Y00787 and SEQ ID r NO:4) of mRNA from the 11^8 (in^rle«kin 8) gene. One 

25 meta^^c cancer marker is ^previously uncharacterized gene (SEQ ID MQ:29) that J&s jiOTciQlogy 
. to a num^r r pf previously identified EST ^uenc^ wMle ^p&er m 

g^e sequence<KAG00262 K ,j ^ 

, q?SA); ;proste^ met^brane ^g<m (PS^^ 

30 phosphatase (PAP); prostatic secretory proteins (PSP^); human kallekrein 2 (HK2) ; and the ratio 
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of the concentrations of free and bound forms of PSA (f/t PSA), in combination with ahy of the 
markers identified herein as SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:4, SEQ ID 
NQ:2 or SEQ ID UO:29 y and the sequences identified in Geiiebank Accession its D8745 1 , T03013, 
X03558, M28 130, and Y00787, their complementary nucleic acid sequences, or their expression 
5 products may be used in all embodiments using or detecting the markers in any of the methods 
^disciosed herein or known in the art. In the examples disclosed herein, tfie differehtial expression 
of marker genes is detected by RNA fingerprinting methods, however, differential expression 
detected by Shy other means, including but hot limited to other RNA fingerprinting methods, 
Northern blotting, ^ iirbiiunodetection, piotein-prbtein interactions, biological activity or other 
10 methods kno wn in the art woulcl fall within the Scope of the present invention. j 

The preserif disclosure is the first report of ah alternatively spliced form of IL-8 mENAthai 
includes intfon 3. In the peripheral blood of nohxial individuals the mRN A transcript containing 
iiitron 3 (Genebank Accession # M28 130) is moie abundant than the previously reported spliced 
form from which nitron 3 is missing (Genebank Accession # Y00787). Surprisingly, in patients 
15 with metastatic prostate cancer the previously reported spliced form is ihuch more abundant, with a 
seven-fold increase compared to normal individuals. In contrast, the transcript containing intron 3 
is approximately seven-fold less abundant in patients with metastatic prostate cancer than in normal 
'■- : '. 'individvials.''" \ .'/.- '" /:' 

The ^iibst^ti^ change in levels of alternatively ^ 
20 blood of individuals With metastatic cancer provides a simple and effective diagnostic test for the 
presence of cancer metastases, that is unaffected by , problems in sampling priihaiy tumors or the 
Masking ihfluendi^f nortoiSI cells in a tissue sample* It therefore represents a significant advance 
over previous methods for detecting and diagnosing metastatic cancer in humans. The skilled 
practitioner Mil realize that metastatic cancer ' detection and diagnosis may be performed by 
25 quantitative aiialysis of eitlWthe EL-8 mKNA transcripts themseto^ products. > 

The present disclosure represents a sjibstantiM ISiii ^e&pec*^ 
kftbwleage in this field/ It i^p^rts a novel Spliced form of It-8 mRNA that is repressed in 
'iSietastatic pfosttdite cancer. It ptovides a sensitive meattfe for detecting metastatic cancer by 
^ ine^sMrig {he levels of the two alternatively spliced IL-^ AftNA forms/ It pibvides a highly 
30 sensitive and specific inethbd for detecting and differentiating between BPH, localized and 
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adyanced forms of prostate cancer by combining detection of IL-8 gene product with other markers 
of prostate disease. 

The present disclosure further demonstrates the feasibility of detecting and diagnosing 
human disease states in general by monitoring changes in the expression of specific genes in 
5 peripheral lymphocytes. The skilled practitioner of the art will realize that such a technique has 
widespread applicability for screening of asymptomatic individuals for disease state marieeis, 

The identified disease state markers may in turn be used to design specific oligonucleotide 
probes and primers. In certain preferred embodiments the term "primer" as used here includes any 
nucleic acid capable of priming template-dqpendent synthesis of a nascent nucleic acid. In certain 
10 other embodiments the nucleic acid may be able to hybridize a template, but not be extended for 
synthesis of nascent nucleic acid that is complementary to; the template. As used herein a << primer ?> 
may be at least about 5, about 6, about 7, about 8, about 9, about 10, about 1 1, about 12, about 13, 
about 14, about 1 5, about 1 6, about 17, about 1 8, about 1 9, about 20, about 21, about 22, about 23, 
about 24, about 25, about 26, about 27, about 28, about 29, about 30, about 35, about 40, about 50, 
15 aibout, 75, about 100, about 1 50, about 200, about 300, about 400, about 500, to one base shorter in 
length than the template sequence at the 3' end of the primer to allow extension of a nascent nucleic 
acid chain, though the 5* end of the primer may extend in length beyond the 3* end of the template 
sequence. In certain embodiments of the present invention the term "template** may refer to a 
iiueleic acid that is used in the creation of a complementary nucleic acid strand to the "template" 
20 strand ^ The template may be either RNA and/or DN A, and the cbmplement^.strand may also be 
RNA and/or DNA. In certain embodiments the complementary strand may comprise all or part of 
^ it is not an 

,Si X complementary strand to the template. Strands that are not e^ctly complementary to the 

, . - V template strand may hybridizei^ecifically to the template strand in detection assays described here, 
25 as well as other assay s known in the art, and sijch complementary strands that can te nsed in 
detection as^ys;%e part of the invention. ♦ 

amplification prbiSedures, th^ .probes and 
y In cert^ a^cts of the iw 

, ,the tetin "amplification" may refer to any method or technique known in the art or described herein 
for duplicating or increasing the number of copies or amount of a target nucleic acid or its 
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complement, in certain Aspects of the invention, the tettn "amplicon" refers to the target sequence 
for amplification, or that part of a target sequence that is amplified, and/or the amplification 
products of the target sequence being amplified. In certain other embodiments an "aihplicbn" iriay 
include the sequence of probes or primers used in amplification. This ahialysis assists physicians in 
5 ^ detecting arid diagnosing ihe drsease state and in determining bptimaf treatnient boiii^ for 
individuals at varying stages of disease state jj>rogressioil 

In light of the present disclosure, one of ordinary skill in the art will select segments from 
the identified marker genes for use iri the different detection^ diagnostic, or prognostic ihethods, 
wector constructs, antibody production, kiu and/or any of the embodiments described herein as part 

10 of the present inventibn. Marker gene sequences include those published in the Genebank database 
*ftiat mateh the identified marker genes: Genebaftk Abcession tiumlbers D87451, T03013, X03558, 
M28 1 30, and Y00787, as well as the sequences disclosed herein as SEQ ID NO: 1 , SEQ ID NO:2, 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, and SEQ ID N0:29, which also include sequences 
for previously uncharicterized marker geiies (UCPB 35, SEQ ID ftO: l ; UC 302, SEQ ID NO:2; 

15 UC 331, SEQ ID NO:29) identified herein. * For example, in certain embodiments in which one 
may be practicing the present invention for the identificatioii df a disease marker, for example, the 
sequences selected to design probes arid primers may include repetitive stretches of aidenine 
nucleotides (poly-A tails) nonfcajly attached at the ends of the RMA for the identified marker genes. 
In certain other embodiments, probes and primers may be* specifically designed to not include these 

20 or other segments froriri the identified marker genes, as one of ordihaxy skilled in the art m&3f deem 
eertaittsfc^ 

For exai^jple, where a gfenoimc sequence is disclosed, one Would use sequences that 
correspond to exori rfegions of tfa& gfcne in most cases. However, as d^cribed hereihv at least one 
metastatic cancer marker inblud^ alternately spliced transcripts so that Sritironid sequence^ inay be^ 
25 used for diagnostic: 6t prognose pib^ses (Genebahk Accession # M28130). E^ori' i5equen<^ in 
the gene Structure, ^ desdM ifi the Genebank listing for AcfeeSSiotf* M28130,'in^Itide bases 
— j l482 ttf 2^ to 2599, 2fctt to 2954, arid 3370 ta 423& IriSftri 3^taud^*ases 2954 to 
3TfO. Otoe of orrfiffiiary Skill in t& art ibay seleti: segm^dts &<M the pu^likh^d exori sequetices, or 
iri^ assemble iSteMii 

30 ^c^ ^iritron l AhdMively, the priblished s^udrice for IL-8 ^ a splibed fomi firom 



WO 98/24935 FCT/US97/22105 

10 

which intron 3 is missing (Genebank Accession # Y00787) may be used. By choosing or selecting 
the sequences to include or exclude intron 3, one could preferentially detect expression of one 
alternatively spliced form of IL-8, or even the ratio of the two forms using the methods disclosed 
herein. One of ordinary skill in the art may select and/or assemble segments from any of the 
5 identified marker gene sequences into other useful fomis, such as coding segment reconstniction's 
of mRN A sequences from published genomic sequences of the identified marker genes, as part of 
tlie present invention. Such assembled sequences would be useful in designing probes and primers, 
as well as, providing coding segments for protein translation, for detection, diagnosis, and prognosis 
embodiments of the invention described herein. 
10 For example, primers to detect the message of IL 8 using the transcribed portions of the 

marker sequence as set forth in the listing in Genebank Accession # M28130 may hybridize to 
nucleotides 1482 tp 1503 and the cpmplement of nucleotides 1 626- 1647. These p^cularprhners 
would amplify a segment of message of the marker gene 166 base pairs in length. Primeis 
designed to nucleotides 1482 to 1503 aiid the complement of nucleotides 2464 to 2483 would 
15 amplify a segment of message of {he marker gene 186 base pairs long in messages that have the 
intervening intron between nucleotides 1648 to 2463 removed. Thus, one skilled in the art would 
be able to calculate the expected size of transcribed sequences from marker genes identified herein 
t ^vhose sequences are, published either as genomic: sequence^ mJRNA, or cDNA, as well as the 
^qu^nces disclosed herein, taking, into account the differences in size of the products produced 
20 cte|^nd^ absence of intronic sequences. In preferred embodiments, the 

differences in size of amplification products using primers designed to regions flanking both sides 
^ -o£ intron 3 iu the :%»8 marker gen^^s # Y00787 and # 

c , f M2 8 1 30) ? pan ibe used in detection, diagnosis, and/or prpgnpsis of metastatic cancer. However, 
; piimere designed to regions of IL- 8 sequences that do not flank intron 3, or the other marker genes 
25 } : > that jdo npt have difference in intron splicing, or tl^at prime mRNA or cDNA template sequences, 
; , ^o^4#ot ^ intronic segments. 

For ,e 1 to 20 and Ae cxm^^ait of nucleotides 

. : r ^2(^^^0ofS^jp NO:l ^ a metastatic niajker^gene segment 220 b^ pairs long. 

30 , • ; ^ IsFQ ba^ pairs long. Pj^ners designed ^ 
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to nucleotides 102 to 120 and the complement of nucleotides 381 to 401 of the IL-8 marker gene 
sequence identified in Genebank Accession # Y00787 would amplify a metastatic marker gene 
segment 302 base pairs long that wbuld be approximately sevenfold less abundant in normal 
patients wheri compared to patients with metastatic prostate canter. Primefrs can be designed to 
5 Amplify the transcribed portions of the metastatic cancer markers that would include any length of 
nucleotide segment of the transcribed sequences, up to and including the full length of each marker 
gene message. It is preferred that the simplified segmeiits of identified marker geiies be an 
ampiicoh of at least about 50 to about 500 bdse pairs in length. It is particularly preferred that the 
amplified segments of identified marker genes be an ampiicoh of at least about 1 00 to about 415 

1 0 "base pairs in length, and/or no longer in length than the amplified segment used to normalize the 
"quantity of message feeing amplified in the detection aissays described herdnl fci !Such assays include 
RNA fihgerprihting methods, however, differential expression may be detebted by btfcer means, 
arid all siicli methods woulcl fall \vittiiri the scope of the present invention. The predicted size of the 
amplified metastatic cancer marker gene segment, cialculatedby the location of the primers relative 

1 5 to the transcribed sequence, wotild be used to determine if the detected amplification product is 
indeed the marker gene beirig ainplified* Sequencing the amplified or detected band that matches 
the expected si2te of the amplification product and comparison of the band's sequence to the known 
or disclosed se^tiene& of the marker gene would confirm that the correct marker gene is being 
amplified attd detectedo 

N 20 The identified markers may also be used to identify and isolate full leng^ gehe se^ifettces, 

' ^ inbludirig regul&tory elements fot |*ene expression, from genomic hufrian ? DNA libraries. The 
eDNA^eiqtiences identified m 
gehoiiiiVhtim 

identified Ml^eiigth genes may be isolated by ^bhirombsomal walking* (also called "overly 
25 hyltfifetiori H ); ^ u <^maillt & Carbon M Ov«lii> Hybridization Screening: - Isolation and 
*tA^^ of ^^4pt>ini WiA Fragments Surroundirig the LEU2 Gene on Weast 

cDNA hybridization probe, nonrepetitive sejg^eti^^ 

-W^b84ised ^ Hh5rbridi2^6ii pirdbes in further genomic library screening, iiltimately allowing 
30 ' ? i§fcI^6nofi^^ It "wUl be recognised that 
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full length genes may , he obtained using the small expressed sequence tags (ESTs) described in this 
disclosure using technology currently available and described in this disclosure (Sambrook et al y 
1 989; Chinault & Carbon* 1 979). Sequences identified and isolated by such means may be useful 
in the detection pf the prostate marker genes using the detection methods described herein, and are 
5 part of the invention. 

The identified markers may be used to identify and isolate cDNA sequences The EST 
sequences identified in the present disclosure may be used as hybridization probes to screen human 
cDNA.librarie? by conventional techniques. It will be recognized that these techniques would start 
by obtaining a high quality human cDN A library, many of which are readily available from 

10 cornmerciaJ or other sources. The library may be plated on, for example, ag^ose plates containing 
nutrients, antibiotics and other conventional ingredients. Individual colonies may then be 
transferred to nylon or nitrocellulose membranes and the EST probe§ hybridized to complementary 
sequences on the membranes. Hybridization may be detected by radioactive or enzyme-linked tags 
associated with the hybridized probes. Positive colonies may be grown up and sequenced by , for 

15 example, Sanger dideoxynueleotide sequencing or similar methods well known in the art. 
Comparison of cloned cpNA sequences with known human or animal cDNA or genomic 
sequences may be performed using; cpmputpr programs and databases well known in the art. 
Sequences identified and isolated by such means ^inay be useful in the detection of the prostate 
disease, or other disease marker genes using the detection methods described herein, and are part of 

20 the invention. 

In one* 

; ) ^ e^r^ion ^ctpre and ^xpres§ed as thg ^icoded proteins or 

I^ptides v 

y.i::. s^ ue n$$& 4}sclpSG^.h^xcit^ as m^ker genes, • C<^#ng sequences i$ay be assembled from amino 
25 ' \. acid ^ncodiiig segn^ente ^ genes, to xemo ve npneoding segments^ or to truncate coding ^ 

; ssequence^pT to use the cx^^gs^^ in 

: te.u^dtoe:pr^ 

- Such protein? or peptides are in turn ,used , as antigens for induction of monoclonal or 

30 ; . ; ; j^olyclpnal antibody produt^Q^ 
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as additional markers for human disease states. Aiitibody-protein binding may be detected and 
quantitated by a variety of means known in the art, such as labeling with fluorescent or radioactive 
ligarids. 

Certain metastatic marker genes disclosed Herein (SEQ ID NO: 1 and Geriebank accession # :' 
5 T03013; and SEQ ID NO:2) do not havd reading frames for translation disclosed. However, one of 
.°"*'oWinairy skill in the tm may translate the identified sequences or segments thereof in the three 
potential tv4dmjr frames to obtain peptides or proteins for use in generating antibodies to these 
marker j-encv Such antibodies may be used to purify the proteins of the marker genes, and the 
^identity b! protein being detected is confirmed by peptide sequencing of the protein. Once 
1 0 "confirm^ a* binding the translation products of the marker genes corresponding to SEQ ID NO:l 
" r arid Gcriebahk accession # TOSb l 3, and/or SEQ ID Wb:2, the antibodies that bind the marker gene 
protein would ^useful in detecting, diagnosis, or prognosis of metastatic cancer. 

An example of ah marker gene sequence that would be preferred for translation would be 
intron 3 ofHL-8 (Gcncbank Accession # M28136): Peptides or polypeptides that contain amino 
15 acid sequences from thii; intron would bfe preferred in the creation of polyclonal or monoclonal 
antibodies that preferentially detect form 

In certain aspects of the present invention the tenns ^immunodetection^, "immunobinding", 
*^mmiindrea^tion^ "lmmundhi^cfidmcai^, "immunosc^fif \ and "radiourmti^^ 
to methods that concern binding, purifying* removing, quantifying or btherwise genemliy detecting 
20 biolbjgical cohipbnentis by obt&riiiijg a sample suspected of cbntaining a protein, peptide or 
antibb^yt and contacting the sample with an ahtibddy or protein or peptide in accoicdahee With the 
^ pifeis^ mVeriti^^W ijfe cfc^ °f 
:; Ju viii^ c In certain p^refe^ed aspecte^ of Ae present ihS^htion, one obtSitis a sample 

suspected of containing a disease state-marker encoded protein^ pep 
25 ahtitkkly, and boift^i^fhe isarriple with a^^tibody or ehcoded proteiii br peptide* as the case may 
%e* and then detects of quantifies the ambimt of immune complex ^^fo the specific 

- ^iiditions^ The steps of various useful iinmuhodeteetion me^ have bem described in the 

30 ^ specific fd^ disetee ^te marieer^ ^6rnprisiri| 
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acids of a sequence comprising the sequences published in Genebank Accession numbers D87451, 
T03013, X03558, M28130, and Y00787, as well as the sequences disclosed herein as SEQ ID 
NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:29. Such 
probes and primers may be of any length that would specifically hybridize to the identified marker 
gene sequences and may be at least about 14, about 15, about 16, about 17, about 18, about 19, 
about 20, about 21, about 22, about 23, about 24, about 25, about 26, about 27, about 28, about 29, 
about 30, about 35, about 40, about 50, about, 75, about 100, about 150, about 200, about 300, 
about 400, about 500, and in the case of probes, up to the full length of the sequences of the marker 
genes identified herein. Probes may also include additional sequence at their 5' and/or 3* ends so 
that they extend beyond the target sequence with which they hybridize. Such primers may be used 
to amplify disease state markers present in a biological sample, such as peripheral human blood. 
Amplification increases the sensitivity of various known techniques for detecting the presence of 
nucleic acid markers for human disease. Probes that hybridize with nucleic acid markers for human 
disease may be detected by conventional labeling methods, such as binding of fluorescent or 
radioactive ligands. The availability of probes and primers specific for such unique markers 
provides the basis for diagnostic kits identifying disease state progression. 

An embodiment of the present invention encompasses a kit for detecting a disease state in a 
biological sample, comprising pairs of primers for amplifying nucleic acids corresponding to the 
marker genes and containers for each of these primers. In another embodiment, the invention 
encompasses a kit for detecting a disease state in a biological sample, comprising oligonucleotide 
probes that bind with high affinity to markers of the disease state and containers for each of these 
probes. In a further embodiment, the invention encompasses a kit for detecting a disease state in a 
biolo^catsample, comprising antibodies specific for proteins encoded by the nucleic acid markers 
of the disease state identified in the present invention. 

In one broad aspect, the present invention comprises an isolated nucleic acid pf a sequence : ... 
comprising SEQ ID NO: 1 , SEQ IE> NO:3, SEQ ID NO:5, SEQ ID NO:4, SEQ ID NO:2, SEQ ID 
— jf the sequences identified in,, Genebank Accession #s D87451, TiQ30^, X03558, 
M28130,and Y00787. The invention further broadly comprises an isolated nucleic acid of between 
1^ andi 100 bases in length, ; either identical to or cpmplernentaiy with portions, of the above 
aj^^ucleic acids. Such isolated nucleic acids may themselves be used as 
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human disease markers, or may be used to design probes ^nd primers specific for disease state 
' markers. The invention further broadly comprises an isolated nucleic acid of between 17 bases to 
the full sequence length, either identical to or complementary with portions of the above rnentiohed 
isolated nucleic acids. 

In another broad aspect, the present invention comprises proteins and peptides with amino 
*acid sequenced encoded by the aforementioned isolated nucleic acids. The proteins and peptides 
may be directly detected in the practice of the invention, of used for antibody production. 

The invention also broadly comprises methbds for identifying biomarkerfc for use in 
^prbgriostit or diagnostic assays of a disease State, using the technique of RNA fingerprinting to 
'identify KNtAs that are differentially expressed between individuals With the disease state versus 
%6rtiial individuals. In the practice of the method; one ^ may" use random hexainbrs, arbitrarily 
chosen oligonucleotides, promiscuous oligonucleotide primers or anchoring primers, &s well as 
Oligonucleotide primers specific for known gene sequences for the reverse transcription step 
and/or for the amplificktion step. The term K pfomiscuous oligonucleotide primers" as used 
herein deribtes oligonucleotides that are ^ati^cally designed to sainple sequence complexity in 
mRNAs, or open reading frames of mRNAs Without bias as applied in a PCR based RNA 
fihgerprinting technique. The use of promiscuous primers is preferred because such use 
; incrSe^ses tlie sampling rate <^ RJ^ for fingefprihHtt^ by iii^easirig the displayed fingerprint 
complexity. This increases the rate at which differentially expressed mRNAs caii be discovered. 
^ oligonucleotide as disclosed herein will be evident to one of 

'^^^ in the art in light bf th£ |mbl^^ti<>h by arid Nigaria, Nature Biotechnology 

certain ^etttbodiments 

^"fe to ptiaiers of reiudom or 

^^ni^Sbitt huiaebti^e seqtietic£ bf ^bbiir & bas&? ift J^gth; to the 
length of the primers may be of any length previously described for "priiiifefs'\ In beitSih aspects 
^^&e^ aire selected at the 

''•ic^r ^ de^tibed by 

6ligbn^Ifeb^de ^qu^te ^ignfed to 
bind to specific genes, IL-8 or PS A for example, may also be used in the practice of this method. 
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The present invention may be described in a broad aspect as a method for identifying 
serological markers for a human disease state. The method comprises the steps of providing 
human peripheral blood mRNAs; amplifying the mRNAs to provide nucleic acid amplification 
products; separating the nucleic acid amplification products; and identifying those mRNAs that 
5 arc differentially expressed between normal individuals and individuals exhibiting a disease 
state. The described method may also comprise, in certain embodiments, the step of converting 
the RNAs into cDNAs using reverse transcriptase to detect and quantitate circulating cells 
induced b\ ihc disease stale. In certain embodiments of the invention conversion of RNA into 
cDNAs using Reverse transcriptase is referred to as a "reverse transcriptase" reaction. Methods 
10 of reverse transcribing RNA into cDN A are well known and described in Sambrook et al y 1989. 
Alternative methods for reverse transcription utilize thermostable, RNA-dependent DNA 
polymerases. These methods are described in WO 90/07641, filed December 21, 1990, 
incorporated herein by reference. In certain other embodiments of the invention a "reverse 
transcriptase** reaction refers to additional steps of amplification of the RNA template or its 
15 cDNA product. Such step of amplification may include any methods known in the art of 
increasing the number of copies of RNA or DNA, as well as the methods described herein. 
Methods; of amplification include the methods described in Davey e t aL 9 EPA No. 329 822 
(incorporated herein by reference in its entirely) :, as well as polymerase chain reaction or ligase 
chain reaction ,-. . . 

20 The meAod described in the previous paragraph may b^ to discover disease markers, 

; for any disease state tliat affects the peripheral blood lymphocytes. Such diseases include, but 
;. ^£*e jpH^Jii^ted to metastatic or organ defined cancer* parjicularjjf metastatic prostate or breast 
; v :C£incer$ > asthma^ lupus , erythemat<^si s, rheiunatoid arthritis, multiple sclerosis,, inyastheni^ gravis* 
, autoimmune, thyroid^^ amyotrophic lateral sclerosis (ALS or Lou Gehrig's disuse), ia^rstitial 
25 / ; . _ , : : - ; w . A:V .. ■ 

/ ; ; s^p^ 

^ : j^eleeted to bind specifically tQ -#n iscd[atec| nucleic acid of a sequence com|?m SEQ JQ NO:l, 
i ■^^■m^^^B^T^ N0;5,SEQ IDN^ 
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"fdentified in Genebarik Accession #s 1)8745 1 , T030 1 S, X0355 8, M28 1 3 0, and Y00787, thereby 
measuring the amounts of nucleic acid amplification products formed. 

The invention further broiadly comprises the pjr6gno$is and/or diagnosis of a disease state by 
measuring thje amounts of nucleic acid amplification products formed. The amounts of nucleic 
5 amplification products identified in an individiial patient may be compared with groups of normal 
individuals or in&iVidiiSls with an identified disease state. Diagnosis may be accomplished by 
finding that the patient's levels of disease state markers fall within the normal range, or within the 
range observed in individuals with the disease state. Further comparison with groups of individuals 
Wf Varying disease state progression, subh as metastatic vs. non-metastatic cancer, may provide a 

10 prognosis for the individual patient The inventidh further broadly comprises kits for performing 
the above-mentioned procedures, coiitaining ^plificatiori primers and/or hybridizationprobes. 

The invention may be described therefore, in ceilaiiri bfoiad aspects as a method of 
detecting a human disease state, comprising die steps of detecting the quantity of a disease 
markdr expressed in human peripheral blood arid comparing the quantity of the said marker to 

15 the quantity expressed in perip>heiial blood 6f a rioritial individual, where a difference in quantity 
of expressibh is indicative of a disease State. In the practice of the method the disease marker 
may preferably be afj j mRNA, or even ah mRNA amplified by an RNA polymerase reaction, for 
example. The mRNA may ated be amplified fey any other means such as reverse transcriptase 
polymerase chain reabtidn or the "H^Se chaiii reaction; The RNA may be detected by My means 

20 todi&ii hi the art, sifch as by RNA fitigefjmfffi a nuclease prdtectiori assay, : 

ifef e^ 

cancer, asthma, lupus erythromatosis, rhe 

* ^o^ 

25 In cettaui pMeired eihb^ 

titfifie sfecj^ences or fee e«mp^ trali^bed sequences disclosed herein ^ ID 

%d^, ^BQ BtJ^O^, SS^ID ND?5, SE0 ID >4£>:4, SEQ EE) >tt©:2, §EQ ID N<*29 atid the 
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The method of detecting a disease state described in the previous paragraphs may further 
comprise the steps of providing primers that selectively amplify the disease state marker, 
amplifying the nucleic acid \yith said primers to form nucleic acid amplification products, 
detecting the nucleic acid amplification products and measuring the amount of the nucleic acid 
5 amplification products formed. In the practice of certain embodiments of the method, the 
primers may be selected to specifically amplify a nucleic acid haying a sequence comprising 
SEQ ID NO:l , SEQ ID NO:3, SEQ ID NO:5^SEQ ID NO:4, SEQ ID NO:2, SEQ ID NO:29 and 
the sequences identified in Genebank Accession #s D87451, TG3Q13, X03558, M28130, and 
Y00787. In certain alternate embodiments, the marker may be ^ polypeptide, and may even be a 
10 polypeptide encoded by a nucleic acid sequence comprising a sequence disclosed herein as SEQ 
ID NO: 1 , SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:4, SEQ ID NO:2, SEQ ID NO:29 and the 
sequences identified in Qenebank Accession #s D87451, T03013, X03558, M2813Q, and Y00787, 
, or it may be described in certain embodiments as a polypeptide encoded by the IL-8 gene. 
Detection of the disease state may be by detection of an antibody immunoreactive with said 
15 marker. It is also an embodiment of the invention that detection may be by a cellular bioassay, 
that responds to the presence of a biologically active agent such as IL-8, for example* In certain 
embodiments of the present invention a "bioassay" is any assay that measures pr detects the 
presence of v a compound or effector, such as a protein, polypeptide, or peptide product of an 
expressed marker gene, by its affect on a cell, organism, or biologically derived reagent or 
20 detection system- Bioassays that t$ay be u^d in the present invention, include, but are not 
, ; Ipq^^ to, those described in, Schroder <?/., 1990 and Yoshimura et al t9 1989, Ku^pwska et al., 
1^ t^O,assay5 laiovm in the 

- ^ait tl^ cau4>e u^ed to detect the expresi^ makers. ,' ; •• r 

The present myeptton broadly comprises production of antibodies specific for proteins or 
25 peptides encoded by SEQ ID NQ:U SEQ ID NO:3, SEQ ID NQ:5, $EQ IP NO:4, SEQ ID NO:2, 
SEQJ^ 

■ a ^ State, ^e levels of ^ch bl^wj a patient 

may be quantitated by Qonvehti^n^ ^thod?.; Coirelation of prptei^^^^ of a 
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human disease or the progression of a human disease may be accomplished as described above for 
nucleic acid markers of human disease. 

Another broad aspect of the present invention comprises the detection and diagnosis of 
disease states, including BPH and prostate cancer, by combining measurement of fevels of two of 
5 inore disease state markers. A broad embodiment of the invention comprises combining 
measurement of seruiri IL-8 gene product with other markers of prostate disease, such as PSA, 
PAP, HK2, PSP94 and PSMA. Yet another broad aspect of the present invention comprises kits for 
detection and measurement of the levels of two or more disease state markers in biological samples. 
¥he skilled practitioner will realize that such kits may incorporate a variety of methodologies for 
10 detection and measurement of disease state markers, including but hot limited to oligonucleotide 
probes, primers for nucleic acid amplification, antibodies which bind specifically to protein 
products of disease state matker genes, and other proteins or peptides which bind specifically to 
disease state marker gene products. 

15 BMEF DEiSCRlPTION OF THE P^WINGS 

= \ The following drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better pnderstppd by 
reference to one or more of these drawings in combination with the detailed description^ of 
20 specificembod ^ 

u FI0..1A. Relative quantitative RT-PGR of UC Bands #325-1 (intron 3-) and 325-2 

Citron 3+) shpws that the nprmally spliced form of IL-8 mRNA (intron 3-) is abundantly 
expressed in individuals with metastatic prostate cancer (M) compared with normal individuals 
25 &($if). The amplification reactions were sampled at different cycle numbers. The alternatively 

spliced form of IL-8 mRNA f intron 3+) is more abundaht in normal individuals than in patients 
; with metastatic cancer. The data were normalized againstjB-actin mRNA. 

FIG. IB. Relative quantitati ve RT-PCR of UC Bands #325-1 (intron 3-) and 325-2 
30 (intron 3+) shows that the normally spliced form of IL-8 mRNA (intron 3-) is abundantly 
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expressed in individuals with metastatic prostate cancer (M) compared with a pool of normal 
individuals (N). The alternatively spliced form of IL-8 mRNA (intron 3+) is more abundant in 
normal individuals than in patients with metastatic cancer. The data were normalized against B- 
actin mRNA. 

FIG. 2. Ability of total PSA (t-PSA) to distinguish BPH and Stages A, B, & C prostate 

cancer. 

FIG. 3. Ability of corrected free/total PSA (ft PSA) ratio to distinguish BPH and Stages 

A, B, & C prostate cancer. 

FIG. 4. Ability of TJC325 (IL-8) to distinguish BPH and Stages A, B, & C prostate 

cancer. 

FIG. 5. Abihty:ofUe325 (jL*8) and t-PSA combined to distinguish BPH and Stages A, 

B, & C prostate cancer. 

FIG; 6; Ability of UC325 (IL-8) and'the f/t PSA ratio combined to distinguish BPH and 
Stages A, B, &; C prostate cancer. / : , 

FIG. 7. Relative quantitative RT-PCR™ showing that UC33 1 mRNA is roughly seven 
times more abundant in the peripheral blood of individuals with recurrent metastatic breast or 
prostate cancer compared to UC331 mRNA levels from healmy volunteers. PCR™ amplification 
of a UC33 1 specific cDNA fragment was performed using the same pools of fi-actin normalized 
cDNAs as templates. PCR™ reactions were terminated after cither 25^ 28 or 31 cycles. Pools of 
cDNAs were constructed ^mpeiipheral blood RN As from eight heialthy vbiu^feers (N); ten 
individuals with recurrent metastatic prostate cancer (P), or ten individuals with recurrent r 
metastatic breast cancer (B). The' intehsity of the bands are proportional to the relative amounts 
of UC33 1 mRNA in the individuals from which these cDNA pools were constructed. 
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FIG 8 PCR™ amplification of a UC332 specific eDNA fragment using the saihe Jiobls 
of normalised cDNAs as templates. PCR tM reactions were terminated after either 25, 28 or 31 
cycles. Pools of cDNAs were constructed from peripheral blood RN As from eight healthy 
volunteers (N), ten individuals with recurrent me^tetic prostate cancer (P), or tein individuals 
wiffi rectoent ifn^tastatic breast cancer (B). The ihtbnsity of the bands are proportional to the 
relative amounts of UC332 mIOQA in the ittdividuals from which these cDNA pools were 
cbfetrubted. > ^ 



Terms used: , „ 

HK2: .. human kallekrein 2 gene product 

PAP: , prostatic acid phosphatase 



PSA: 
15 PSMA: 
PSP 94 : 



t-PSA:; . W^-v. .tbtaJ PSA^, . 



f/t (Free/Total PSA)* ratio of free to total PSA, measured in serum specimens; with moderately 
" elevated t,PSA . / ., ■ ^i^S.i^U^ -.vc- V-.wV - ■->■ 

20 IkfS: , , v ; ; Interleukin-8 (UC 325) i? ? - ; , r 

v T ^ ^^Sarjiypif ^^rue P^sitiyes/(True Positives + F^se Negatives); plotted ottyrmis of ROC 



curve. 



^ ^ Negatiyes)/(Tnie Negatives + False Positives); plotted on y x x axis (as 7- 

-\,V : -;v-,..-ri ,. - Specificity) of ROG curve, ^:.,^- v .. ■ 

25- i .w- '^R0Gi • .v, Receiver Operator Character Curve; a means of plotting ^sensitivity and 

specificity over a i^ge of cy^^ 

StageACaP: organ-confined clinical stage of prostate cancer in which tumor is hot 

30 palpable by a digital rectal exam (DRE) (Walsh & Worthington, 1 995). 
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Stage B CaP: organ-confined clinical stage of prostate cancer in which tumor is palpable 

by a digital rectal exam and involves one or both lobes of the gland 

(Walsh & Worthington, 1995). 
Stage C CaP: non-organ-confined clinical stage of prostate cancer in which tumor is 

palpable by a DRE and invades beyond the capsule and/or the seminal 

vesicles (Walsh & Worthington, 1995). 
Stage D CaP: non-organ-confined clinical stage of prostate cancer characterized by 

metastases to lymph nodes, bone or other distant organ site (Walsh & 

Worthington, 1995). 

The present invention concerns the early detection, diagnosis, and prognosis of human 
disease states. Markers of a disease state, in the form of isolated nucleic acids of specified 
sequences from the peripheral blood of individuals with the disease state, are disclosed. These 
markers are indicators of the disease state and are diagnostic for the presence of the disease state in 
patients. Such markers provide considerable advantages over the prior art in this field. Since they 
are detected in peripheral blood samples, it is not necessary to suspect that an individual exhibits 
the disease state before a sample may be taken. The detection methods disclosed are thus suitable 
for widespread screening of asymptomatic individuals. Further, the methods provide for sensitive 
detection of disease state markers that is relatively unaffected by the presence of normal, non- 
diseased cells in a biological sample such as peripheral blood. 

It will be apparent that the nucleic acid sequences disclosed will find utility in a variety of 
applications in disease state detection, diagnosis, prognosis and treatment. Examples of such 
applications within the scope of the present disclosure comprise amplification of markers of the 
disease state using specific primers, detection of markers of the disease state by hybridization with 
oligonucleotide probes, incorporation of isolated nucleic acids into vectors, expression of vector- 
ihcorporated nucleic acids as RNA and protein, and development of immunologic reagents 
corresponding to marker encoded products. j-;.^' :;}■■■■ -r' 

It is important to note that UC-^25 (Il>8) serology hi combination with PSAahdfTtPSA 
can more accurately differentially diagnose prostate cancer and BPH. This method' provides 
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significant advantages over previous methodologies for detecting prostatic cancer, which often 
failed to differentiate between prbstatic cancer and BPH. 

A. NiicleicAcids ^ : ; r y « 

5 As described in Examples 1 through 4, the present disclosure provides five markers of a 

^disease state, identified by RNA fingerprinting. These include two previously previously 
unchafacterized gene products, as well &s nucleic acid products of the EL-* (interieukih 8) and 
human elongation factor 1 -alpha genes. 

In one embodiment, the sequences of isolated nucleic acids disclosed herein find utility as 
10 hybridization probes or amplification primers. These nucleic acids may be used, for example, in 
^diagrio^tic evaluation' of tissue samples or employed to Clone full length cDMAs or genoiriic clones 
corresponding thereto. In certain embodiments, these 

fragments. Such fragments are of sufficient length to provide specific hybridization to an RNA or 
DNA sample extracted from tissue. The Sequences typically will be 10-20 nucleotides, but may be 
15 longer. Longer sequences, e g., 40, 50, 1 00, 500 and even up to full length, are preferred for certain 
embodiments. 

Nucleic acid ihblecul^ 15 y 17, 20, 30, 40, 50, 60, 

75 or 100 or 500'hucleotides of : a sequence comprising Genebank Accession numbers D87451, 
t0301 3, X03558, M28130, ¥00781?, SEQ ID NQ: 1^ SEQ ID *J©:2, SEQ H3> NO:3> SEQ ID NO:4, 
20 SEQ ID NOt5, or SEQ ID NO 29 are contemplated. Moleculeg that are compleriientaiy to the 
above mentioned sequences andthat bind to these sequences titode^Mgh 5 ^ are 
also contemplated. These probes are usefft in>&%i^ as 
' • Sbutherh kid northern blotting. In some caSes, it is ebhteiiipl^ted tHat^ may be used that 
" hybridize to nriiltiple target sequences withotit 
25- ; ""^^^e^stafe;. :; -'wlU^^ •■ } ^: -; 

% i V^otts probes and primers may be d^gfted ^rcmrid ^ the discl&&d nucleotide sequences. 
Primers ffeSy be 6f any lefrgth biit^ typically, are 3<)fe20 bases m length. OBy ^%ning tumeric 
values to a sfequenee, for example, the first re^idtieis 1, the second residue is % etc , te^gorithm 
A definingM^prin^ers may be ptt>posed: ' ' 't; • h 

30 ntoh + y 
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where n is anjnteger from 1 to the last number of the sequence and y is the length of the primer 
minus one (9 to 19), where n + y does not exceed the last number of the sequence. Thus, for a 10- 
mer, the probes correspond to bases 1 to 1 0, 2 to 1 1 , 3 to 1 2 ... and so on. For a 1 5-mer, the probes 
correspond to bases 1 to 15, 2 to 16, 3 to 17 ... and so on. For a 20-mer,the probes correspond to 
bases 1 to 20, 2 to 21 , 3 to 22 ... and so on. 

The values of n in the algorithm above for each of the nucleic acid sequences is: S EQ ID 
NO:l,n~253; SEQ ID NO:2, n- 183 ; SEQ ID, NO:3, n= 387; SEQ ID NO:4, n= 366; SEQ ID 
NO:5,n=598. , 

In certain embodiments, it is contemplated that multiple probes may be used for 
hybridization to a single sample. For example, an alternatively spliced form of IL-8 mRNA, 
containing intron 3, may be detected by probing: hiunan tissue samples with oligonucleotides 
specific for intron 3 and for exoxi portions of the IL-8 transcript Hybridization with the intron 3 
and exon sequences probe would be indicative of a normal individual and binding to only the exon 
probe would be indicative of metastatic prostate cancer. 

The use of a hybridization probe of between 17 and 100 nucleotides in length allows the 
formation of a duplex molecule that is both stable and selective. Molecules having complementary 
sequences over stretches greater than 20 bases in length are generally preferred in order to increase 
stabiiitytand selectivity of the hybrid,^ 

molecules. It is generally preferred to de§i^ nuqleic acid nlolep to 30 

nucleotides, or even longer. Such fegments may be readily prepared t>y, for ex^ple^ directly } { 
ius^^esmrig ^e- ffeigment by chemical means ,pr ; by introducing selected s^uences into 
jrecombdn^ ? ^ . ...... 

^ The compli^ aqid sequence, is well known in the art ami is, baged on the 

i^atiti-paj^efe: ^ pf nucleotides (bases) for a -given niicleict acid tp^lymer 

(strand). Two complementary strands of DNA are formed into a duplex by^2uaring Q^bases, e^g. 

ofJRNA) and all "V 

^mpimrnn^ defines a second p^^gleic acid, wggfc^ stand of 

nucleic acid to form a duplex molecule in which base pairs, are matched as ^ 0:C^Q:<3^^:l7U or 
T/U:A. 
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A complement may also be described iis a fragment of E)NA (nucleic acid segment) 1 or a 
synthesized single stranded oligomer that may contain small mismatches or gaps when 
hybridized to its complement, but that is able to hybridize to the complementary DIsfA under 
high stringency conditions. To hybridize is understood to mean the forming of a double stranded 
5 molecule or a molecule with partial double stranded nattire. High stringency conditions are those 
that allow hybridization between two homologous nucleic atcid sequences, but precludes 
hybridization of random sequences. For example, hybridization at low temperature and/or high 
ionic strength is termed low stringency. Hybridization at high temperature and/or low ionic 
strength is lenned hiyh stringency. Low stringency is generally performed at 0.15 M to 0.9 M 

10 NaCl at a tenVperature range of 20°C to 50°C. High stringency is; generally perforihed at 0.02 M 
to 0 15 M NaCl at a temperature range of 50°C to 70°C. It is understood that the temperature 
and ionic strength of a desired stringency are determined in part by the length of the particular 
probe, the length and base content of the target sequences, and to the presence of formamide, 
tetramethylammonium chloride or other solvents in the hybridization mixture. It is also 

15 understood that these ranges are mentioned by way of example only, and that the desired 
stringency for a particular hybridization reaction is often determined empirically by comparison 
to positive and negative controls. 

Accordingly, the nucleotide sequences of the disclosure may be used for their ability to 
selectively form duplex molecules with complementary stretches of genes or RNAs or to provide 

20 primers for amplification of DNA or RNA from tissues. Depending on the application envisioned, 
it is preferred to employ varying conditions of hybridization to achieve varying degrees of 
selectivity of probe towards target sequence. 

For applications requiring high selectivity, it is preferred to employ relatively stringent 
conditions to form the hybrids. For example, relatively low salt and/or high temperature 

25 Conditions, such as provided by about 0.02 M to about 0. 1 0 M NaCl at temperatures of about 50°C 
to about 70°C. Such high stringency conditions tolerate little* if any, mismatch between the probe 
and the template or target strand, and would be particularly suitable for isolating specific genes or 
detecting specific mRN A transcripts. It is generally appreciated that conditions may be rendered 
more stringent by the addition of increasing amounts of formamide. 
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For certain, applications, for example, substitution of amino acids by site-directed 
mutagenesis, it is appreciated that lower stringency conditions are required. Under these 
conditions, hybridization may occur even though the sequences of probe and target strand are not 
perfectly complementary, but are mismatched at one or more positions. Conditions may be 
rendered less stringent by increasing salt concentration and decreasing temperature. For example, a 
medium stringency condition may be provided by about 0.1 to 0.25 M NaCl at temperatures of 
about 37°C to about 55°C, while a low stringency condition may be provided by about 0.15 M to 
about 0.9 M salt, at temperatures ranging from about 20°C to about 55°C- Thus, hybridization 
conditions may be readily manipulated depending on the desired results. 

The following codon chart may be used* in a site-directed mutagenic scheme, to produce 
nucleic acids encoding the same or : 
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TABLE 1 



Amino Acids 



Codons 



Alanine * 


Ala 


A 


GCA 


GCC 


QCG 


GCU 






j Cysteine -"-='•' 


Gys 


c 


UGC 


UGU 










A<*n?rrtic *icid 


Asp * 


D 


GAC 


GAU 








;• i-\ :* ■'*' ;:■ 


Glutaini e acid £ 


Glu 




GAA 


GAG 










: ■ x i It J Ijr luial JJ * -1 V ; 


Phe 

A Alls 


F "• 


JJUC 


uuu 






.... _ . . . . . 








VJ.. 


GGA 


GGC 


GGG 


GGU 






nibuuiJic 
--^ isoieuGine? ; 




n 


CAG 


CAU 




. > ' " > 3> - ■ ■■ 






-; lie •, , . 


1 


AU A 


AUC 


AIJll 

- f»vM. 






-; ? "I ' : 


i>ysine . .. . 


Lys •• 




- AAA 


AACi 

AAV 










Leucine 


Leu 


L 


tiiiA 

: U Uri 




n ia 








Methionine 


Met j 


"Nil 


AI If? 












>\!5paragine 




IN '. . 


AAC 


AAU 










Proline* 


Pro 


P 

X : 


CCA 


GCC 


CCG 


ecu 






Glutaixiine 


Gin 


Q 


GAA 


CAG 










*- Asinine ;■ 


. ■ Arg ' 




AGA 


A^ ; 


CQA 


/.,:-fGC, v 


CGQ 


CGU 


Serine 


Ser 


s 


AGC 


AGU 


UCA 


ucc 


UCG 


UCU T 


[ v ' ; ^IlH^iiiiie; : ;; 




:t)v.. 




AOC 


ACG 


.^HACU.;..,/ 












*y- GUA ; 


GUC • 


GUG 


GUU 


■ ^ f » 




Tryptophan •• > 






; UGG 








■ *:"-* "'. A- 

'\ r,.; 
>. .• ■ •: ; 










,-■ & :rV ^ UAC 






i,./. .. • 


( "■";;>{..; 





In other embodiments, hybridization may be achieved under conditions of, for example, 50 
5, inMTris-HCl (f>H 5.3)^75 mM^Gl, 3 J mM^MgCfe* 10 mMdithiothreitol, at temperatures be 

approximately 20°C to aboyt 37°C. Other hybridization conditions utilized may include 
approxin^a|ely 10 mftf Tris.HCl, (plj 8.3), 50 mM KG1> L5 mM MgCl2, at temperatures ranging 
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In certain embodiments, it is preferred to employ isolated nucleic acids of the present 
disclosure in combination with an appropriate means, such as a label, for determining 
hybridization. A wide variety of appropriate indicator means are known in the art, including 
fluorescent, radioactive, enzymatic or othbr ligands, such as avidin/biotin, which are capable of 
5 being detected. In preferred embodiments, one may employ a fluorescent label or an enzyme tag 
such as urease, alkaline phosphatase or peroxidase, instead of radioactive or other environmentally 
undesirable reagents. In the case of enzyme tags, colorimetric indicator substrates are known which 
may be employed to provide a detection means visible to the human eye or spectrophqtometrically, 
to identify specific hybridization with complementary nucleic acid-containing samples. 

1 0 In -general, it is contemplated that the hybridization probes described herein are useful both 

as reagents in solution hybridization, as in PGR, for detection of expressionof corresponding genes, 
as well as in embodiments employing a solid phase. In embodiments involving a solid phase, the 
test DNA (or RNA) is adsorbed or otherwise affixed to a selected matrix or surface. This fixed, 
single-stranded nucleic acid is then subjected to hybridization with selected probes under selected 

1 5 conditions. The selected conditions depend on the particular circumstances based on the particular 
criteria required (depending, for example, on the G+C content, type of target nucleic acid, source of 
nucleic acid, size of hybridization probe, etc). Following washing of the hybridized surface to 
remove non-specifically bound probe molecules, hybridization is detected, or even quantified, by 
means of the label. 

20 It is understood that this disclosure is not limited to the particular probes disclosed herein 

and particularly is intended to encompass at least isolated nucleic acids that are hybridizable to 
nucleic acids comprising the disclosed sequences or that are functional sequence analogs of these 
nucleic acids. For example, a nucleic acid of partial sequence may be used to identify a 
structurally-related gene or the full length genomic or cDNA clone from which it is derived. 

25 Methods for generating cDNA and genomic libraries which, may be used as a target for the above- 
described probes are kiK>\vn in the art (S^brook el ai , 1089). 
• ^ ^F6r application^^ nucleic acid s^^rientis of the present disclbstiire are 

ihccxrpb^^ such ais plasmids, fedsmids of \arus^s, fhese segments mSy be combined 

:"^ : -f-w sequehces, such as^jjninimers, poiy^dy^Iiation j^^nids, restrict!*^ sites, 

30 multiple cloning sites, other coding segments, and the like, such that their dVer&il lengidi may vary 
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cohsiderably. It is contemplated that a nucleic acid ^gment of almost any length may be 
employed, with the total length preferably being limited by the ease of preparation and use in the 
intended recombinant DNA protocol, ' ; - 4r . 

DNA segments encoding a specific gene rriay be introduced into recombiiiihthost cells for 
5 expressing a specific structural or regulatory protein. Alternatively, through the application of 
"genetic engineering techniques, subportioiis or derivatives of. selected genes iray-be employed. 
Upstream regions containing regulatory regions such as: prorikrter regions may be isolated and 
subsequently employed for expression of the selected gene. 

Where an expression product is to be generated, it is possible for the nucleic acid sequence 
10 to be varied while retaining the ability to encode the same product; Reference to the cddbn chart, 
provided in Table 1, enables the design of any nucleic acid encoding the same protein or peptide 
product 

B. Encoded Proteins 

15 Onde the entire coding ^ueticetff a m has b^en determined, the gene 

rriay be inserted into an appropriate expression system. The gene may be expressed in any number 
of different recombinant DNA expression systems to generate large amounts of the polypeptide 
pfrbduct, whjoh rii^ which may 

also be us&fiil in the practice of the disclosed invention; Fdr fekample; polyclonal 

20 antibbd^^ t; 

"gene. &ucii 

diagnosis of the disease state in peripheral blood or other tissue samples. > * 

,v • in the art ^^ fa^ coli, yeast 

' kifch aS Pidhia pt&^ 

25 £6^ c^Cl^ coll 
' %nd in baeulovihis expression sy^ems; A coni^Iete gene may %e expressed or* alternatively* 

In oiie^ ^ tlie gehe |e^u^ 

putative transmembrane sequences. Such sequences are typic^ly very hydfc^^ 
}0 detected by the use; of sequence analysis software, such as Lasergene (DNAstar, Madison, WI). 
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The presence of transmembrane sequences is often deleterious when a recombinant protein is 
synthesized in many expression systems, especially E. coli> as it leads to the production of insoluble 
aggregates that are difficult to renature into the native conformation of the protein. Deletion of 
transmembrane sequences typically does not significantly alter the conformation of the remaining : 
5 - protein structure. 

Moreover, transmembrane sequences, being by definition embedded within a membrane, 
are inaccessible. Antibodies to these sequences will not prove useful for in vivo or in situ studies. 
Deletion of transmembrane-encoding sequences from the genes used for expression may be 
achieved by conventional techniques. For example, restriction enzyme sites may be used to excise 
10 the desired gene fragment, or PCR-type amplification may be used^ to amplify only the desired part 
of the gene. 

In another embodiment, computer sequence analysis is used to determine the location of 
predicted major antigenic determinant epitopes of Jthe polypeptide. Software capable of carrying 
out this analysis is readily available commercially. Such software typically uses conventional 
15 algorithms such as the Kyte/Doolittle or Hopp/Woods methods for locating hydrophilic sequences 
which are characteristically founcj^on the surface of proteins and are, therefore, likely to act as 
antigenic determinants. • . 

+ Once v this analysis is made, polypeptides may be prepared which contain at least the 
essential features of the antigenic determinantand which may be employed in the generation of 
20 antisera against the polypeptide. Minigenes or gene fusions encoding these determinants may be 
constructed and inserted into expression vectors by conventional methods, for example, using PCR 
cloning methodology. ; ... . r v v 

,, A gene or gene fragment encoding a polypeptide rnay be inserted into an e^ession vector 
by ; conventional subcloning techniques. In one embodiment, an R cqli expression veptpr is used 
25 , whichprodiuces the recom^n^tj^I^^ fusion protein, allowing rapid affinity purification 

- >pf;4l>e protein.: Exann^es or the; j^utetWone S- 

transferase system (Pharmacia, £iscata^ 
^ ©e^eriey^ MA), the FLAG system New Haven* CT), anjd ;Jhe #*His system (Qiagen, 
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Some of these systems produce recombinant polypeptides bearing only a small number of 
additional amino acids, which are unlikely to affect the antigenic character of the recombinant 
polypeptide. For example, both the FLAG system and the 6xHis system add only short sequences, 
both of which are known to be poorly antigenic arid which do riot adversely affect folding of the 
5 polypeptide to its native conformation. Other fusion systems produce polypeptide where it is 
Mesfrable to excise the fusion partner from the desired polypeptide. In brie embodiment, the fusion 
partner is linked to the recombinant polypeptide by a peptide sequence containing a specific 
recognition sequence for a protease. Examples of suitable sequences are those recognized by the 
^Tobacco Etch Virus protease (Life Technologies, (Saithersbuf g, MD) or Factbr Xa (New England 
10 "Biolabs, Beverley, MA). 

In another embodiment, the expression sysrtern used is one driven by the baculovirus 
polyhedron promoter. The gene encoding the polypeptide riiay be manipulated by conventional 
techniques iri order to facilitate cloning into the baculovirus Vector. One baculovirus vector is the 
pBlueBac vector (Invitrogeh, Sorrento, GA). The vector carrying the gene for the polypeptide is 
15 transfetted mto Spodoptera Jh^giper^a (S&) cells by conventional protocols, and the cells are 
icriltrired arid processed to produce the recombinant antigen. See Summers e/ al.^ A MANUAL OF 
METHdDS FOR BACULOVIRUS VECTORS AND INSECT CELL CULTURE 
PROCEDURES, Texas Agricultural Experimental Station; U.S. Patent No. 4,2 15,05 1 (incorporated 




20 As an alternative to recombinant pbl^eptides^ synthetic peptides Corresponding to the 

antigenic determinants may be prepared. Such peptides are at least six amino acid residues long, 
aincf tt^y cbritaih up to approximately 50 J residues, which iSfthe Approximate upper length limit of 
%^ Such a$ those available from Applied BiOsystems (Foster 

CA): te^f si^ sm^il peptides for vaccination^^ 
25 v tb im^im as hepatitis B surface Mfigeh, keyhole limpet hemocyanin 

^^^^^^ ^ knbWhin^theart 
~ ^ ' t< < jrfy&ne ^^di&eht, airiitio adift seJg^e^^ 

? A 11iete may, fbr instance, be minorsequeric^^ which arise due to natural 

variation within the population or they may be homologues found iri; other species. They also may 
30 be sequences which do not occur naturally but which are sufficiently similar that they function 
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similarly and/or elicit an immune response that cross-reacts with natural forms of the polypeptide. 
Sequence variants may be prepared by conventional methods of site-directed mutagenesis such as 
those described above for removing the transmembrane sequence. 

Amino acid sequence variants of the polypeptide may be substitutional, insertiorial or 
5 deletion variants. Deletion variants lack one or more residues of the native protein which are not 
essential for function or immunogenic activity, and are exemplified by the variants lacking a 
transmembrane sequence describe^ above. Another common type of deletion variant is one lacking 
secretory signal sequences or signal sequences directing a protein to bind to a particular part of a 
cell. An example of the latter sequence is the SH2 domain, which induces protein binding to 
10 phosphotyrosineresidues. 

Substitutional variants typically exchange one amino acid fpr another at one or more sites 
within the protein and may be designed to modulate one or more properties pfflie polypeptide, such 
as stability against proteolytic cleavage. Substitutions preferably are conservative, that is, one 
amino, acid is replaced with another of similar shape and charge. Conservative substitutions are 
15 well known in the art and include, for example, the changes of: alanine to serine; arginine to lysine; 
asparagine to glutamine or histidine; aspartate to glutamate; cysteine to serine; glutamine to 
asparagine; glutamate to aspartate; histidine to asparagine or glutamine; isoleucine to leucine or 
valine; leucine to valine or isoleucine; lysine to arginine or glutamine; methionine to leucine or 
isoleucine; phenylalanine to tyrosine, leucine or methionine; serine to threonine; threonine to 
20 serine; tryptophan to tyrosine; tyrosine to tryptophan or phenylalaiiine; and valine to isoleucine or 
jjracine. • ".v.---" 

^ iBseitipn^ yw^nts include fusion proteins such as those used to allow rapid purification of 
the polypeptide and also may include hybrid proteins containing sequences from other homologous 
A ', pipteins and polypeptides. For example, an insertional variant may include portions of the amino 
2? V acid sequence of the, polypeptide from one species, together with portions of the homologous 
P^lj^ptide^ Other insertional variants may include those in wMch a^ditioiial 

^ ; ^ sequence of the ppljrp^ptide. These typically are 

trailer insertions fhap th^ fusion proteii^ des above, and 'are introduced* for ex^paple, to 
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In one embodiment, major antigenic determinant^ of the polypeptide are identified by an 
empirical approach in which portions of the gene encoding the polypeptide are expressed in a 
recombinant host, and the resulting proteins tested for their ability to elicit an immune response. 
For exanrple, PGR may be used to prepare a range of peptides lacking successively longer; 
5 ^fragments of the C-terminus of the protein. iTie^ triunundprotective activity of dach of these y 
^peptides then identifies those fragments; or domains of the polypeptide which are essential for this 
activity. FiMther studies in which oiily a small nuihber of amino; acids are removed at each iteration 
then enables the location of the antigenic determinants of the polypeptide. 
*V Another embodiment for the preparation of polypeptides accbkiirig to the disclosure is the 
10 use of pieptide mimfetics. Is4imetics are peptide^ of ' . , 

^roteiti secondary structure. See, for example, Johnson e/ al, "Peptide Turn Mimetics" in 
BIOTECHNOLOGY AND PHARMACY, Pezzuto et all Eds., Chapman and Halt; New York 
(1993). The underlying rationale behind the use of peptide mimetics is that the peptide backbone of 
prbteins exists chiefly to orient amino acid side chains in such a way as to facilitate molecular 
15 interactions, such as those of antibody and aritigeh. A jieptide mimetic is expected to permit 7 . . 
molecular iiiidractions similar to the natural molecule. 

Successful applications of the j^jptide miniefic concept h^ 
of p-tums within proteins, which are known to be highly antigenic. Likely p-turn stirticturef within 
an polypeptide may be predicted by computer-based algorithms as discussed above. Once the 
20 component amino acids of the turn are deteisnined, peptide m^ cohstructed to achieve 1 

a similar spatial orientation of the essential elemeiit^ of the anaind acid sidechains. 

^CI Prepa rat ion of Antibodies Specific for Encoded Proteins 

25 The cDNiA& ^ ; 

■■ 

^ SE(fli9 ^0:5, II) HO or .prbteins. The 

* ' ^hgirfeetihg of DN& ^gta<£ht(s) for eXpressicM'ih a pf #karyotic or eufcaryotic system may be 
^etforriied by fethniques generally known in the aft of i^cbmbinant expression. It is believed that 
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virtually any expression system may be employed in the expression of the claimed isolated nucleic 
acids. 

Both cDNA and genomic sequences are suitable for eukaryotic expression, as the host cell 
generally processes the genomic transcripts to yield functional mRNA for translation into proteia 
In addition, it is possible to use partial sequences for generation of antibodies against discrete 
portions of a gene product, even when the entire sequence of that gene product remains unknown. 
Computer programs are available to aid in the. selection of regions which haye potential 
immunologic significance. Software capable of canning out this analysis is readily available 
commercially, for example MacVector (TBI, New Haven, CT). The software typically uses 
conventional algorithms such as the Kyte/Doolittle or Hopp/Wpods methods for locating 
hydrophilic sequences which are characteristically found on the surface of proteins and are 
therefore: likely to act as antigenic determinants. 

It may be more convenient to employ as the recombinant gene a cDNA version of the gene. 
It is believed that the use of a cDNA version provides advantages in that the size of the gene is 
generally much smaller and more readily employed to transfect the targeted cell than a genomic 
gene, which is typically up to an order of magnitude larger than the cDNA gene. However, the 
possibility of employing a genomic version pf a particular gene or fragments, thereof is specifically 
contemplated. ; . ..... 

y As use*! herein, the terms "ejngmeered" and "f^^fei^t" cells are intended to refer to a 
cell into whioh an ejcogenpiis WSIA segment or gene, such as a cDNA or gene has been introduced. 
Therefore, engineered c^lls a^ ^istinguishabl 

a recombinant^ introduced exogenous DNA segment or gene. Engineered cells are thus cells 
having a gene or genes u^<^^ 

having an introduced cDNA or genomic gene, and also include genes positioned adjacent to a 
promoter no^ t ^ 

j ^ wildrtype, in 

ltdisclaj^e^ 

l^prppCTativ^ 

a ceding ^equ^n^^ p^pmpter, or tp "p^ratiyely Jiflk" to a 

promoter, one positions the 5 f end of the transcription initiation site of the transcriptional reading 
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frame generally between about 1 arid about 50 nucleotides "downstream" of (/.e., 3* of) the chosen 
promoter. The "upstream" promoter stimulates transcription of the DNA and promotes expression 
of the Encoded recombinant protein. This is the meaning of "recombinant expression" in this 
context 

Many conventional techniques arc available to construct expression vectors containing the 
Appropriate nueleic acids and transCriptional/transIatfori^^ in order to achieve 

£ix>teiii or peptide: fekpfession in a variety of host-ekpression systetris. Cell types available for 
expression iiicltide, but are hot limited to, bacteria, such as E: coli m\d B. subtilw 
recombinant bacteriophage DN^ 

Certain examples of prokarybtic hosts are E' coli straiin RR1, & coli LE392, El coli B, 
f ±coliX 1776 (ATCC No. 3 1537) as well as E coli W3110 (F-, Imiibda^, prototrophic, ATCC No. 
27332:5); bacilli such as Bacillus subtilise and Other entefobacteriaceae such as Salmonella 
i^himuHum^Serraiia marcescens y and various Pseudomonas species. 

In general, plasmid vectors containing rejplicon and control "sequences which are derived 
from species compatible with the host cell are used iri connection With these hosts. The vector ; 
ordinarily carries a replication site, as well as marking' sequences Which are capable of providing 
phehotypic selection in transformed cells. For example, K coli is ofletiti^sfo pBR322, 
a plasmid derived from an E coli species. pBR322 contains geftes for ampicUlin and tetracycline 
r resistance and thus provides easy means for identifying transforined cells. The pBR plaSihid, or 
Other microbial plasmid or phage must also contain* or he iAbdified to contain, promoters which 
may be used by the microbial organistti for expression of its OWta proteins. 

In addition, phage vectors containing replicon and control s^uoic&s thki ^e c^inpatible 
Srifthhe host mictoorg^sto may be used as transforming vectors in connection Vtfth these hosts. 
v To* GEM™-! I may be^tilizedfflirMihg a recoihbi^aat^iage vector 

whi<Stmayi^ suchas £.Vo/ii;E392; > n 

> ^ J Ftti&er iisSful Vector^ iriciude pIN vectors (Inoiiye ^if dl\ 1985i)i arid pGEX vectors, for use 
ih^gm^^ j>^0teifis for later pMfibation and 

SCpdura^^ Other suitable fusion ptoteSfe ^ with feg^iet^ or 
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Promoters that are most commonly used in recombinant DNA construction include the 0- 
lactamase (penicillinase), lactose and tryptophan (trp) promoter systems. While these are the most 
commonly used., other microbial promoters have been discovered and utilized, and details 
concerning their nucleotide sequences have been published, enabling their ligation into plasmid 
5 vectors. 

For expression in Sgccharomyces, the plasmid YRp7, for example, is commonly used. This 
plasmid already contains the //pi gene which provides a selection marker for a mutant strain of 
yeast lacking the ability to grow in tryptophan, for example ATCC No. 44076 or PEP4-1. The 
presence of the trp\ lesion as a characteristic of the yeast host cell genome then provides an 
10 effective environment for detecting transformation bygrowth in the absence of tryptophan. 

Suitable promoting sequences , in yeast vectors include the promoters for 3- 
phosphoglyceratekinase or other glycolytic enzymes, such as enolase, glyceraldehyde-3.phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate 
isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, 
15 phosphoglucose isomerase, and glucokinase. In constructing suitable expression plasmids, the 
termination sequences associated with these genes are also ligated into -the expression vector 3' of 
the sequence desired to be expressed to provide polyadenylationof the mRNA and termination. 

Other suitable promoters, which have the additional advantage of transcription controlled 
by growth conditions, include the promoter region for alcohol dehydrogenase 2, isocytochrome C, 
20 acid phosphatase, degradative enzymes, associated with nitrogen metabolism, and the 
aforementioned glycei^dehyde-3 r phosphatedehydrogenase, and enzymes responsible for maltose 
3« d galactose utilization. ,. . ;i , : f . - t 

: In,additionto nticro^rganisms, cultures of cells derived from multic^Uul^ organisms may 
also be used; as hpsts. Jn principle* any such cell culture is workable, whether from vertebrate or 
25 invertebrate culture. In additionto mammalian cells, these include insect cell systems infected with 
^ systems infected with 

n*e«e^bw C^yfV; tobacco mosaic virus, 

pt ti^sfonned with rec^m^n^t plasinid exp (e,g., Ti p]asm»d) containing 

one or more coding sequences. ^ 



fin :.t'2i 
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In a useful insect system, Autographic* calsfornica nuclear polyhidrosis virus (AcNPV) is 
used as a vector to exprfess foreign genes. The virus grows in Spodoptera frugiperda celte. The 
isolated nucleic acid coding sequences are cloned into nonessential regions (for example the 
p61yhedriii gene) of the virus and placed under control of an AcNPV promoter (for example the 
5 polyhedrin prorhoter). Successful insertion of the coding sequences results in the inactivation of 
Ithe polyhedrin gene and production of non-occluded recombinant virus (z.e., virus lacking the 
proteinaceous coat coded for by the polyhedrin gene). These recombinant viruses are then used to 
infect Spodoptera frugiperda cells in which the inserted gene is expressed (e.g.; U.S. Patent No. 

10* Examples of useful mammalian host cell lines are VERO and HeLa cells, Chinese hamster 
^vaiy (CHO) cell lines, Wl 38, BHK, COS-7, 293, tiepG2, 3T3, 1UN ahd MDCK cell lines. In 
addition, a Host cell strain may be chosen that modulates the expression of the inserted sequence, 
or modifies and processes the gene product in the specific fashion desired. Such modifications 
(e.g., glycosylation) and processing (e.g., cleavage) of protein products may be important for the 

15 function of the Encoded protein: 

Different host ciells have characteristic and specific mechanisms for the post-translational 
processing and modification of proteiins. Appropriate cells lines or host systems may be chosen to 
help ensure the correct modification and processing of the foreign protein expressed. Expression 
vectors for use in mammalian cells ordinarily include an origin of replication, a promoterlocated in 

20 frorit of the gene to be expressed, along with ^y necessary ribosbme binding sites, RNA splice 
sites; polyadenylatiori site, and transcriptional terminator sequences. The origin of replication may 
b6 provided either by construction of the vector to include ail ekogehdus origin, sudh as may be 
tderived from SV40 or other viral (e.g 9 Polyoma^ i^rio, ^ be provided 

by the hbst cell chromosoiiial replication mechanism If the Vector is integrated into the host cell 

25 tlkomosbme, the latter is often suffi 

v '& The promoters maybe derived from the genohie of mammalian cells (e.g., IhetMiothionein 
|)f ©iiioter) 6i fronf mariiittalian viruses (e.g., the adenovirus late promoter; the Vaccinia virus 7.5K 
f pjforiiotibf). I%ther, it is also possilde 

with the g^e sequ^ceW interest, prdvided $iich control sequences are compatible with the host 
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A number of viral based expression systems may be utilized. For example, commonly used 
promoters are derived from polyoma, Adenovirus 2, and most frequently Simian Virus 40 (SV40). 
The early and late promoters of S V40 virus are particularly useful because both are obtained easily 
from the virus as a fragment which also contains the SV40 viral origin of replication. Smaller or 
5 larger SV4Q fragments may also be used, provided there is included the approximately 250 bp 
sequence extending from the Hind III site toward the Bgl I site located in the viral origin of 
replication. 

In cases where an adeno virus is used as an expression vector, the coding sequences may be 
ligated to an adenovirus transcription/ translation control complex, e.g., the late promoter and 
10 tripartite leader sequence. Thi s chimeric gene may then be inserted in the adenovirus genome by in 
vitro or in vivo recombination. Insertion in a non-essential region of the viral genome (e.g., region 
El or E3) results in a recombinant virus that is viable and capable of expressing proteins in infected 
hosts. 

Specific initiation signals may also be required for efficient translation of the claimed 
15 isolated nucleic acid coding sequences. These signals include the ATG initiation codon and 
adjacent {sequences. Exogenous translational control signals, including the ATG codon, may 
additionally need to be provided* This need is readily determinable and the necessary signals 
readily provided. It is? well known that the initiation codon must be in-frame (or in-phase) with the 
reading fi^ime of the desired coding sequence to help ensure translation of the entire insert These 
20 exogenous translational control signals and initiation cpdons may be of a variety of origins, both 
natural and ;^th^ of expression may be enhanced by the inclusion of 

^ 1937). 

incorporate into the ti^i^eriptipnal unit 
v ,.. ; ; an appropriate polyadenylatipn site (e.g., 5>AAJAAA-3') if one was not contained within the 
25 original cloned segment Typically, the poly A addition site is placed about 30 to 2000 nucleotides 
"dp^ynstream^of the termination site of the protein at a position prior to transcription termination, 
n of recombin^t j>rpteins, stab|e expr^ion is 

^^^^^ ^ For example, celt lines that stably expr^s constructs encoding prp^^ way be 

' ; : €?figfe^^ using egression vectors that contain viral origins of rej^catk^ host cells 

30 may be transformed with vectors controlled by appropriate expression control elements (e.g., 
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promotei- or enhancer sequences^ transcription terminators, pblyadehylatiori sites, etc.); and a 
selectable marker. Following the introduction of foreign DNA, engineered cells may be allowed to 
grow for 1-2 days in an enriched medium and then are switched to a selective media. The 
selectable itiarker in the recombinant plasmid confers resistance to the transformant and allows 
5 bells tc> stably integrate the plasriiid into their chromosomes and grow to form foci which in turn 
■ri&y be cloned and expanded into* cell lines. 

A numfvr o t selection systems may be u^d, including^ 'but* not limited, to the herpes 
simplex \inis thymidine kinase (Wigleir et iril, \97t), ; hypoxanthine-guanine 
4 pWosphi>riK>N> Itmhsfcrase (Siybalska et aL, 1 962) and adenine jphosphdribosyjtransferase genes 

10 (Lbwy ct ul\ I c >K0). in ifc-; hgprt* --tit aprt- <:ells, respectiveily; Alsd, antimetabolite Resistance may - * 
bS'used as thV tidsis of selection for dfrfir, that confers resistiance to methotrexate (Wigler et al y 
1980; O'lTare et ul 9 1981); gpt, that confers resistance to iftyeopheh61ic : ^6id (Mulligan et al % 
19^81); neo; that con fefs resistance: to the aminoglycoside G-4 18 (Colberre^Garapin et al, 1981); 
and hygro: thai confers resistance to fiygromycih. 

15 It is cbntempl^ 

i.e: y expressed in iribre^d le^els^rela^Ve to their natural expression in normal human cells, or even 
relative to the expression of other p>roteihs in the recombinant host cell . Such overexpression may 
be assessed by a vaWet^ and/or protelh purifibatio^ However, . 

simple and dirtct metliods are prefeirr&d, for exam^ 

20 staining or We^tetH blotting, followed by Quantitative analyses, such ris densitoftietric scanning of 
the tesultaiit gel 6r blot; A specific: increase in the teverof ther^ 

fetyihi^^ io IhH -level ^^'^Ra^brdd a> Jn&xf^>cell& % 'ki^d^Ve1>f bv^jigpi^l^ >m a relative 
SSbtodaiice of the Specific proteiri in relation to the other proteirispf oduced by thehast cfcll and, e.g., 
visible on a gel. xjj]* :,- •. •: \.f^^iul -^^v^k^cr^ • ; 

25* 4 . r !: • - ■ " ' :: " ■ V X " / a... : . . •. • "~ .' ' . «..,••- ox ^ r " ... .■ . 

m ■ t txt , Fiirthef ^peels^f tfife present disclosii^ <^>nl5^ the putitotfofe^&aid in jjarticulaf 

protein'br-pefitMe ""-as tfsed herein, is intended to refe^ to a edmpositi6rij<lfs»lataMe from other 
30 y cdmpbnents, Wherein the protein or peptide is purified to any degree relative to its naturalry^ r 



> 
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obtainable state, i.e., in this case, relative to its purity within a cell extract. A purified protein or 
peptide therefore also refers to a protein or peptide, free ; from the environment in which it may 
naturally occur. 

Generally, "purified" refers to a protein or peptide composition which has been subjected to 
5 fractionation to remove various other components, and which composition substantially retains its 
expressed biological activity. Where the term "substantially purified'' is used, this refers to a 
composition in which the protein or peptide forms the major component of the composition, such 
as constituting about 50% or more of the proteins in the composition. 

Various methods for quantifying the degree of purification of the protein or peptide aie 
10 known iii the art. These include, for example, determining the specific activity of an active 
&actio,1 > ? x assessing the number of polypeptides within a fraction by SDS/PAGE analysis. A 
preferred method for assessing the purity of a fraction is to calculate the specific activity of the 
fraction, to compare it to the specific activity of the initial extract, and to thus calculate the degree 
of purity, assessed by a "-fold purification number". The actual units used to represent the amount 
15 of activity is dependent upon die particular assay technique chosen to follow the purification and 
whether or not the expressed protein or peptide exhibits an enzymatic or other activity. 

Various techniques suitable for use in protein purification are known in the art These 
include, for example, precipitation wi th ammonium sulfate, PEG, anti bodies and the like or by heat 
denaturation, followed by centrifugation; chromatography steps such as ion exchange, gel filtration, 
20 reverse phase, hydroxylapatite. and . affinity chromatography; isoelectric focusing; gel 
electrophoresis; and combinations of such and other techniques. As is generally known in the art, it 
%; is befieved that the order of conducting the various purification steps may be changed, or that 
;eertain steps may be, omitted, and still result in a suitable method for the preparation of a 
substantially purified protein or peptide. 
25 There is no general requirement that a protein or peptide always be provided ih its most, 

purified state. Indeed, it is contemplated that less substantially purified products have utility in 
^ in 

0/^ utilizing an HPLC 

3 9 ; ^PParatm generally results in a greater -fold purification than the same technique utilizing a low 



WO98/24035 PCTT/tjS^7/22105 

41 

pressure chromatography system. Methods exhibiting a lower degree of relative purification may 
have advantages in total recoveiy of pfotem 

protein. V 

It is known that tfte migration of a polypeptide may vary, sometimes significantly, with 
5 ^ Bidphys Resi Comm., 7(5:425,1977). 

:t is therefore appreciated that under differing electrophoresis conditions, the apparent molecular 
wei ghts of purified or partially purified expression products may vary. 

u ./.'4*.:\ «£• Antibody Generation ; T 

10 ■"' : --'- : h' : .'- - ' : . For some emlxxliniehts, it is preferred to produce antibodies that bind With high 

^specificity to the protein product(s) of an isolated nucleic acid of a sequence comprising Genebank 
" • A&^sioh mimbers 087451, T03 01 3, X03558, 1VI28T1 30, Y00787; SEQ tt> ttO:l •> SfiQ ID NO:2, 
' SKQ ID NO:3, SEQ ID ^0:4, SEQ ID NO:5, or SEQ ID NO:29. Means for preparing and 
bharacteri£ing antibodies are Weill krioWn in the art (See, e.g., Antibodies: A Labbratoiy Maniial, 
15 Cbld Springi&ai^ 1 988). > 

Methods for generating pblyclortal Antibodies' are well khowii in the art. Briefly, a 
polyclonal antibody is prepared by immunizing an animal with an immunogenic composition and 
collecting antisera fr^ A wide xange of animal species may be used for 

the production of aiitisera, including rabbits, mice, rats, hamsters, guinea pigs goats Because of 
20 the relatively large ' blood Volume of rabbits, a rabbit is a preferred choice for production of 
polyclonal antibodies. :< - 5 ' " : 

; As>is wdf ^tiWh in Hit aift, a gnferi composition may vaiy iri its itidmuftiDgetiiijity. It is 

often necessary therefore to boost the host immune system, as may ^ a 
^ peptide or pdlj^peptide immunogen to a carrier. Exemtplary ihd prdf^eSd^ carrieis are keyhole 
25^ ^ and ^Noviilie Seriki albumin (BSA). Other^ oviSlbumin, 

— ^ Me^ifoi^et^i^ting 
v | ^^y^itiae to a ^cairiex proteiii ak^%eIF?^^ M M i^ ^ 
m-mafeiiriLi^ carbodiiiindeahd 

^ 1 a ' As is also well ldio\Vri iri the art, 7 the nMlavmogeiiiOity 6^F a ^^dcular immunogen 
0 A ' -^mjfxMioM ^i^^^^ danced by the iiSe tif n^-spedffib ^^tilk(^^thfe ifitnuh JrtSpbnse, 
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. known as adjuvants. Exemplary and preferred adjuvants include complete Freund's adjuyant (a 
non-specific stimulator of the immune response containing killed Mycobacterium tuberculosis) y 
incomplete Freund's adjuvants and aluminum hydroxide adjuvant 

The amount of immunogen composition used in the production of polyclonal antibodies 
5 varies with the nature of the immunogen as well as the animal used for immunization. A variety of 
routes may be used to administer the immunogen (subcutaneous, intramuscular, intradermal, 
intravenous and intraperitoneal). The production of polyclonal antibodies may be monitored by 
sampling blood of the immunized animal at various points following immunization. A second, 
booster, injection may also be given. The process of boosting and titering is repeated until a 
10 suitable titer is achieved. When a desired level of immunogenicity is obtained, the immunized 
animal may be bled and the serum isolated and stored, and/or the animal may be used to generate 
monoclonal antibodies. For prpduction of rabbit polyclonal antibodies, the animal may be bled 
through an ear vein or alternatively by cardiac puncture. The removed blood is allowed to 
coagulate and then centrifuged to separate serum components from whole cells and blood clots. 
1 5 The serum may be used as is for various applications or else a particular antibody fraction may be 
purified by well-known methods, such as affinity chromatography using another antibody or a 
peptide bound to a solid matrix. 

Ivlonoclonal antibodies (MAbs) may be readily prepared through use of well-known 
techniques, such as those exemplified in U.S. Patent 4,196,265, incorporated! herein by reference. 
20 Typically, this technique involves immunizing a suitable animal with a selected immunogen 
composition, e.g., a purified or partially purified expressed protein, polypeptide or peptidey The 
immunizing composition is admiwstered in v a manner effective to stimulate antibody producing 
: cells^as described above.. , |W> 

v , Thei methods for generati^ along the same 

25 w : : imes as those for preparing polyclonal antibodies v J^ode^ts rats are preferred 

^ may 
^ ^ jmice are 



V :: : ;u^ ; ^e ^m^s are. injected with antigen as described above. The aaitigen may, be cotipled to 
30 ^ carrier molecules such as keyhole limpet hemocyanin i£necessary. The antigen isj^ically mixed 
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^ with adjuvant, such as Freund's complete or incomplete adjuvant. Booster injections with the same 

antigen typically occur at approximately tWo- week intervals. 

Following immunization, somatic cells with the potential for producing antibodies, 
specifically B lymphocytes (B cells), are selected for use in the MAb generating protocol. These 
5 * cells may be obtained from biopsied spleens, tonsils or lymph nodes, or from a peripheral blood 
^sample. Spleen cells and peripheral blood cells are preferred, the former because they are a rich 
: -= source of antibody-producingcells that are in the dividing plasmabl ast stage, arii^i the latter because 
peripheral blood is easily accessible. Often, a panel of animals are immunized and the spleen of the 
^toimal with the highest antibody titer is removeid and the spleen lyiriphocytes obtained by 
10 homogenizing the spleen with a syringe. Typically, a spleen from an immimized mouse contains . 

The antibody-producing B lymphocytes from the immunized animal are then fused with 
cells of an immortal myeloma cell, generally one 6f the same species as the animal that was 
immunized. Myeloma cell lines suited for use in hybndoma-producing fusion procedures 
15 preferably are non-antibody-producing, have high fusion efficiency, and enzyme deficiencies that 
render then incapable of growing in selective media which support the growth of only the desired 
fused cells (hybridomas). 

Any one of a number of m^ 
65-66, 1 986; Campbell, pp. 75-83, 1 984). For ex^ple, where the ittu^ 
20 one may use P3-X63/Ag8, X63-Ag8.6^3, NSl/l.Ag 4 1, ^^Ag:14, FO, NSG/U* MPCM 1, 

t1-R983F arid 4B21 0; and U-266, GM1500-GRG2, LICR-LON-H^ arid UC729-6 are all useful in 
connection with human cell fusions. ' 7 

One preferred murine myeloriia cell is the NS-1 myeloma cell line (also termed PS^NS-l- 
25 ' Ag4^i), which is readily available from the NIGMS Human Genetic Mutant Cell Reppatory by 

requesting cell line repository number GM3573. Another mouse my elom be : 

'^fe^edisth^ 

, • 5 * " - "* ? " Methods for generating hybrids of ^tib@^^^bducing^ l^>ldeh%r l^^K node sells and 
'myeloma cells "usually comprise mixing somatic cells with myeloma cetf Is in a 2*1 proportion, 
30^ though me proportion may vary from about 20: 1 to about 1 :1 , respectively, in thfe^presence of an 
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agent or agents (chemical or electrical) that promote the fusion of cell membranes. Fusion methods 
using Sendai virus have been described by Kohler and Milstein (1975; 1976), and those using 
polyethylene glycol (PEG), such as 37% (v/v) PEG, by Gefter ah (1977) The use of electrically 
induced fusion methods is also appropriate (Goding pp. 71-74, 1986). 
5 Fusion procedures usually produce viable hybrids at low frequencies, about 1 X 1CT 6 to 

1 X 10 . However, this does not pose a problem, as the viable, fused hybrids are differentiated 
from the parental, unfused cells (particularly the unfused myeloma cells that would normally 
continue to divide indefinitely) by culturing in a selective medium. The selective medium is 
generally one that contains an agent that blocks the de novo synthesis of nucleotides in the tissue 

10 culture media. Exemplary and preferred agents are aminopterin, methotrexate, and az^erine. 
Aminopterin and methotrexate block de novo synthesis of both purines and pyrimidines, whereas 
azaserine blocks only purine synthesis. Where aminopterin or methotrexate is used, the media is 
supplemented with hypoxanthine and thymidine as a source of nucleotides (HAT medium). Where 
azaserine is used, the media is supplemented with hypoxanthine. 

1 5 The preferred selection medium, is HAT. Only cells capable of operating nucleotide salvage 

pathways are able to survive in HAT medium. The myeloma cells are defective in key enzymes of 
the salvage pathway, e.g., hypoxanthine phosphoribosyl transferase (HPRT), and they cannot 
survive. The B cells may operate this pathway, but they have a limited life span in culture and 
generally #e within about two weeks. Therefore, the only cells that i^ay survive in the selective 

20 media are those hybrids formed from myeloma and B eells. 

This culturing provides a population of hybridomas from which specific hybridomas are 
selected,; Typically, selection c>f hybridomas is performed l>y culturing the cells by single-clone 
dilution in microtiter plates, followed by testing the individual clonal supernatants (after about two 
to weelcs) for the desired reactivity.; The assay should be sensitive, simple and rapid, such as 

25 radioimmunoassays, enzyme immunoassays, cytotoxicity assays, plaque assays, dot r ; 

.11^ Selected hybridonjias are thbn : serially diluted and cloned in^tp: ^individual 
nF^P^H^ then t>e propagated indefi^tely to provide MAbs. 

GeI1 lines may be q^ploi^ production in twa basic ways, A sample of the hybridoma 

\ ffi^X injected (oft<^ intp tijie peritoneal cavity) into a histocompatible animal of the tyjp^that was 
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useid to provide the somatic and myeloma cells for the original fusion. The injected animal 
develops tumors secreting the specific monoclonal antibody pfoduced by the fused cell hybrid. The 
body fluids of the animal, such as serum or ascites fluid, may then be tapped to provide MAbs in 
high concentration. The individual cell lines may also be cultured in viiro, where the MAbs are 
5 "naturally secreted into the culture medium from which they may be readily obtained iii s high 
^concentrations. MAbs produced by either means miay be further purified as needed, using filtration, 
centriftigation and various chromatographicmethods such as HPLC or affinity chromatography. 

Large amounts of the monoclonal antibodies of the present disclosure may also be obtained 
by multiplying hybridoma cells in vivo. Cell clones are injected into mammals which are 
10 histocompatible with the parent cells, e.g., syngeneic mice, tb cause girowth of ^tibody-producing 
tumors. Optionally, the animals are primed with a hydmfcairbbn, Especially oils such as pristane 

In accordance with the present ihveritidh, fragments of hiohoclorial antibodies may be. 
obtained by methods which include digestion 6f monoclonal antibodies with enzymes sucli as 
15 pepsin' or papain aiid/or cleavage of disulfidfe bonds by chemical reduction. Alternatively, : • 
mbnoclonSal antibody fragments encbiiiipasseci by the present disclosure may be synthesized using 
ah automated peptide synthesizer. 

The monoclonal conjugates of the present disclosure are prepared by methods known in the 
art, e.g.* by reacting a monoclonal antibody prepared as described above with, for instance, an 
20 enzyme in the presence of a coupling agent siich as glirtaraldehyde or periodate. Conjugates with 
fluorescein makers are prepared in the presence of %hese cotiplihg agents ot by reaction with an 
isothiocyanate. Conjugates with metal chelates are similarly : pr^ ■' ' 

^ -tables tiiiay be c^iyugat^d include l3l I ^P, 35 S, ^C, 51 Cr, 36 C1, 

- i57 Co, ^C6; 5 %e, ? ^Se; tf| -Eti, ari& 
25 *&htfo^ disclosiarfe ar6 jproduced 

• ^ecordiiigt^^eil-kttbwnto For ifi^kic^,m<)n6clbnal ahtiboctiesmay TOiddinated 

r " lby : . ebhtafct with sodium bit j^tassiuiri iiikiide Mfd a chemical oxidizing agent such ia£ sodium 
hypbchlbn^^ Monoclonal antibodies 

ateoriding ^6 ^ flie disclosure may be labeled with teeMfefitim^ by iigand exchange process, for 

3 0 " 
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Sephadex column and applying the antibody to this column or by direct labeling techniques, e.g., 
by incubating pertechnate, a reducing agent such as SNC1 2 , a buffer solution such as sodium- 
potassium phthalate solution, and the antibody. 

It will be appreciated that monoclonal or polyclonal antibodies specific for proteins that arc 
5 preferentially expressed in the peripheral blood of individuals withthe disease state have utilities in 
several types of applications. Thesis may include the production of diagnostic kits for use in 
detecting or diagnosing the disease state. It will be recognized that such uses are within the scope 
of the present invention. 

10 P. Immunodetection Assays 

1. Immunodetection Methods 

In still further embodiments, the present disclosure concerns immunodetection methods for 
binding, purifying, removing, quantifying or otherwise generally detecting biological components. 
The encoded proteins or peptides of the present disclosure may be employed to detect antibodies 
15 haying reactivity therewith, or, alternatively, antibodies prepared in accordance with the present 
invention, may be employed to detect Ae encod 

immunodetection methods have been described in the scientific litpratur^, such as, e.g., Nakamura 
, , et al. (1987)., ( 

v In general, the immimobinding methods m obtai^g a sample sxispectpd of containing 

20 ^^rot^in, peptide or antibody, and contacting the sample wffo^ ^tibody or protein or, peptide inn 
accordance with the present invention* as the case may be^ under , qonditiops eff^tive to alio w the 

v >vf^(t|ye component in a sample, w or quantitation of any 

25 J ^ the bidding .process, H^re^pn^ Qbtains a sample ??u^p«:ted of 

W^^^f*^ 1 ^^ : ^. ; i ^t^n**H&er encoded protein, ^ 

; Pontects the s^pl^ mth^ an antibody qr encoded r ^tem or peptide, ^ the caise ma^.be^ and then 

specific conditions- > • 

, . In terins of aiitigeB ddtQction, the biological sample analyzed would ordinarily consist of 

30 periphery bjood, Hovvreyer^ it may be any sample that ^ 
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specific antigen, such as a lymph node tissue section of specimen, a homogenized tissue extract, an 
isolated cell, a cell membrane prepaiatibn- separated or piirified forms of any of the above protein- 
Cont^hingcompositiorts. 6f ahy other biological fluid that comes into contact with diseased tissues, 
indhidiftg'fy^ ■'■■■/' 
5 v Contactihg the chosen biological sample %ith the protein, peptide or antibody under 

conditions Effective and for a period of time sufficient to allow the formation of immune complexes 
^pnmaiy iriihiune complexes) is generally a matter 6f siAiply adding the composition to the sample 
and ihcubatirig the mixture for a period of time ldng enough for the antibodies to form immune 
cdbiplbxes with, /. e. , to bind to, any antigens 7 presierii After this time, the sample-antibody 

10 coi^pdsittdni such as ri tissue section, ELIS A plate, dot blot or Western blot; is generally washed to-.- 
remove any non-specifically bound antibody species, allowing only those antibodies specifically 
bound withiii the primary immune complexes to be detected; 

In general, the detection of immunocomplex formation is well known in the art and may be \ 
deffieve^ through the application of numerous approaches. These methods are generally based 

15 upbn the detection of a label or marker, &ueh ak any radioactive, fluorescent, biological br- 
enzymatic tags or labels of conventional use in the art. U.S. Patents concerning the use of such 
labels ihclude 3,817,837; 3,850,752; 3^9^50; 3,996,345; 4^77,437; 4^75,149 and 4,366,241, 
each incoipdratedftereiii by referenced Of cotijr^e, bhe may find additional iadVantage^ through the 
: sbfcond aiitibo^ bf^ a 'bibtin/aviditi ligand binding 

20 arrangement- as is known in the art* 

"The encoded protein, peptide or cbriesj^bnding antibody employed irii the detection may 

Alti^SH^ely, the first added cornpoiierit that beebmies bbiiilld Within the pi^aiy itnmxme 
25 bbni^ the' 
^^cod^^fe^*^ 

< ^W^^M to a detSbt^ble l^bfeL llhe may 
th^ firiinary immufi^^ a*e cont?teted with the 

r iid^ed^ secbhd^ 
30 sufficient to allow the formation of ^^d^^fi^ 
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complexes are then generally washed to remove any non-specifically bound labeled secondary 
antibodies or ligands, and the remaining label in the secondary immune complexes is then detected 
Further methods include the detection of primary immune complexes by a two step 
approach. A second binding ligand, such as an antibody, that has binding affinity for the encoded 
5 protein, peptide or corresponding antibody is used tp form secondary immune complexes, as 
described above. After washing, the secondary immune complexes are contacted with a third 
; ; binding ligand or antibody that has bindmgafmiity for me second antibody /agam un 
5 r effective and for a period of time sufficient to allow the formation of immune complexes (tertiary 
immune complexes). The third ligand or. antibody is linked to a detectable label, allowing detection 
10 of the tertiary immune complexes thus formed. This system may provide for signal amplification if 
this is desired. 

The immunodetection methods of the present disclosure have evident utility in the 
diagnosis of human disease states. A biological or clinical sample suspected of containing either 
the encoded protein or peptide or corresponding antibody is used. However, these embodiments 
15 also have applications to non-clinical samples, such as in the titering of antigen or antibody 
samples, in the selection of hybridomas, and the like. 

In me clinical diagnosis or monitoring of patients with a disease state, the detection of an 
antigen encoded by a disease state marker nucleic acid, or an increase in the levels of such ah 
antigen* in comparison to the levels in a cprrespondmgbiplogical sample from a normal subject is 
20 indicative of a patient with the disease state. The basis for such diagnostic methods lies, in part* 
withitiie finding that the nucleic acid disease state markers identified m, the present disclosure are 
r. dverexpressed in peripheral blood samples frpm individuals with the disease state (see Examples 1 
through 4 below). £y extension, it may be inferred fliat at least some of these markers produce 
• - elev^^ leyels; pfcncoded proteins, that n^aj^be nsedas. diabase ffitemaijkciy • 
25, Methods of differentiating between significant expression of a biomatker, which represents 

^ * or background expression of a biomarker are well known in 

vj* 5 ^ ^ above which increased 



,! high 
^^¥t by a high proportion of cells 

30 ,iir from witWn a tissue mat each give a positive signal. 
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2. tmmunohistochemis&y 

The antibodies of the present disclosure may also be used in conjunction with both 
fresh-frozen and formalin-fixed, paraffin-embedded tissue blocks prepared from .study by 
5 immunohistochcmistry (IHC) or fixed cells on microscope slides for immunocytochemistry. Th^ '- 
method of preparing tissue blocks from these particulate specimens has been successfully used in 
previous IHC studies of various prognostic factors and is well known to those of skill in the art 
(Brown e/ a/., 1990; Abbondanzo et al y 1 990; Alfred et dl , 1 990). 

: P± Briefly, frozen-sectionsmay be prepared by rehydrating 50 ng of frozen "pulverized" tissue 
10 it roorii temperature in phosphate buffered saline (TBS) in sm^ll plastic capsules; pelleting the 
particles by centrifugation; resuspending them in a viscous embedding medium (OCT); inverting 
the capsule arid pellfetihg again by centrifugaitiori; stiap^freezihg m -70 P G isop£ntane; cutting the 
plastic capsule and removing the frozen cylinder of tissuie; securing the tissue cylinder on a cryostat 
microtome chuck; and cutting 25-50 serial sections containing ari average of about 500 iiitact cells. 
15 " Permanent-sections may be prepared by a similar m^ 

mg sztmple in a plastic" microfuge tube; pelleting; resuspending in 10% formalin for 4 hours 
fixation; wasbing/pelleting; resuspending m agar; pelleting; c^ in ice water to 

harden the agar; removing the tissue/ag^ biock fr^ infiltrating and embedding the block 

in paraffin; and cutting up to 50 serial penrianeht sections. - 

: . 3 v ; MowCyiometry "■ ■ " H ! ; - ■ » : "V-*:;- - 

: ' ^ Expre&ed pifoteiiis riiay also b& detected by fldw cytometry as described in 
#uj^hima et al, 1 996. In the practice of the method, the cells are fixed and then incubated with a 
monbclorial MilxKly agaiiist the expressed protein to d&emed; %e :bound ^tibodies are then 
25 ^ AHg^MUJ^ 

be measured by ftow cytoMc^ Oftho diagnostics, or 

" ^ light 

30 * cbfl Size afid ih& right aligle light scatter td cell d^hsi% cdD contour ^d nucleo^tytoplasmic ratioi 
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Since cells are tagged with fluorescent labeled antibody they can then be further characterized by 
fluorescence intensity and positive and negative windows set on the FACS to collect bright 
fluorescence and low fluorescence cells. Cells are sorted at a flow rate of about 3000 cells per 
second and collected in postii\ c and negative cells. 

5, ' 

, 4. o - ELISA 

As noted, it is contemplated that the encoded proteins or peptides of the disclosure 
have utility as immunogens. e.g. r ? in connection with vaccine development, in 
immunohistochemistry and in ELISA assays. One evident utility of the encoded antigens and 
10 corresponding antibodies is in immunoassays for the detection of disease state marker proteins, as 
needed in diagnosis and prognostic monitoring. 

Immunoassays, in their most simple and direct sense, are binding assays. Certain preferred 
immunoassays are the various types of enzyme linked immunosorbent assays (ELIS As) and 
radioimmunoassays (RIA) known in the art. Immunohistochemical detection using tissue sections 
15 is also particularly useful. However, it is readily appreciated that detection is not limited to such 
techniques, and Western blotting, dot blotting, FACS analyses, and the like may also be used. 

In one exemplary ELISA, antibodies binding to the encoded proteins of the disclosure aifc 
immobilized onto a selected surface exhibiting protein affinity, such as a well in a polystyrene 
microtiter plate. Then, a test composition suspected of containing the disease state marker antigen, 
20 such as a clinical sample, is added to the wells. After binding and washing to remove non- 
specifically bound immunecomplexes, the bound antigen may be detected. Detection is generally 
^ a^hietved by the addition of a second antibody specific for the: target prptein, that is linked to a 
" ^^te J^ls ^ a simple ' -sandwich ELISA". Detection may also be 

. ^ has 
25 ; ^ ? \ 
In ^©fo^ 

antigen are immobilized onto the well surface and then contect^ ^th the. antR^dBes; of the 
invention, ^iter binding anci was^g to i^ 
> ik ^ Lin 4 ^ig^R is det^ted, - q \Vhe|^ f ti^ mitid ^tibodies are, linked to a d^t^tabfe label, the 
30^;:-; imn^^ directly. Again, the immunecomplexes may bedetepted using v 
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a second antibody that has binding affinity for the first antibody, with the second antibody being 
linked to a detectable label. 

Another ELISA in which the proteins br peptides are immobilized^ involves the Use of 
antibody competition in the detection. In this ELISA, labeled antibbdies are added to the wells^ 
5 allowed to bind to the disease state marker protein, and detected by means of their label. The 
amount of marker anti gen in ah unknown sample is then determined by mixing the sample with the 
labelecTantihiHj^ The presence of marker antigen 

in the sample acts to reduce the amount of antibody available for binding to the well and thus 
deduces the ultimate signal. This is appropriate for detecting antibodies in an unknown sample, 

10 *where thc iitilabelcd antibodies bind to the Antigen-coated wells ahd reduces the amotiiit of antigen 
available tb bind the labeled antibodies; 

Irrespective of the format employed, ELLS As have certain features in common, such as 
coating, incubating or binding, washing to remove no bound species^ and detecting 

the bound immdhccomplexcs: These are described as follows: 

15 In coating a plate With either antigen or antibody, it is typical to incubate the Wells of the 

plate with ia solution of the afrtigeh or antibody, either oyeriiight or for a specified period of hours. 
The wells of the plate are then washed to reiiibVe incompletely adsorbed rhatbriali Any remaining 
available sulfates of the 1 wells are then "coated" with a nonspecific protein that is 'antigenically 
neutral with regard td the test antiseral These include bovine serum albumin (BSA), casein and 

20 solutions of milk powder. The coating allows fo^ the 
f " v - lihmbbiHzihg surface and thus reduces the Mckgrourid G&usecf by nonspecific binding of antisera 

i ^ '* Itt ^LiSA^ it is more: cb^^^ a 
f di^6t pitoatirc. TTiuS, dftef binding b^^ 
25 reactive material to reduce &a£k^^ the - V:. 

'immobilizing surface is contacted with the control and/or clinical or biological sample to be tested 
under conditions effective to allow immtme^ Detection of the 

• ^^n^tmecb^ labeled stetfdhdary biiidihfe ligsWd ot antibody, or a secondary 



i. 



10 
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"Under conditions effective to allow immunecomplex (antigen/antibody) formation" means 
that the conditions preferably include diluting the antigens and antibodies with solutions such as 
BSA, bovine gamma globulin (BGG) and phosphate buffered saline (PBS)/Tween. These added 
agents also tend to assist in the reduction of nonspecific background. 

The "suitable" conditions also mean that the incubation is at a temperature and fora period 
of time sufficient to allow effective binding. Incubation steps are typically from about 1 to 2 to 
4 hours, at temperatures preferably on the order of 25° to 27°C, or may be overnight at about 4°C 
or so. 

Following all incubation steps in an ELISA, the contacted surface is washed so as to 
remove non-complexed material. A preferred washing procedure includes washing with a solution 
such as PBS/Tween, or borate buffer. Following the formation of specific immunecomplexes 
between the test sample and the originally bound material, and subsequent washing, the occurrence 
of even minute amounts of immunecomplexes may be determined. 

To provide a detecting means, the second or third antibody has an associated label to allow 
detection. Preferably, this is an enzyme that generates color development upon incubating with an 
appropriate chromogenic substrate. Thus, for example, one may contact and incubate the first or 
second inununecomplex with a urease, glucose oxidase, alkaline phosphatase or hydrogen 
peroxidase-conjugated antibody for a period of time and under conditions that favor the 
development of further immunecornplex formation (e.g., incubation for 2 hours at room 
20 temperaturem a PBS-cpntainingsolutionsuch as PBS-Tween), 

After incubation with the labeled antibody, and subsequent to washing to remove unbound 
material, the amounts label is quantified, e.g., by incubation with a chromogenicsubstrat^such as 
^^m^^m^^Mf 3 ^^ 2>2;-azidc>^.(3^tiiyi) 1 W [ABTS] and 

H 2 Q2, in the case of peroxidase as the enzyme label. Quantitation is then achieved by measuring 
25 the degree of color generation, using a spectrophotometer, 

» of Antibodies for Radioimaging f , , 

^ i • ? .-^-^^iQ^ofilmiMsi^i^^^vs^to quantify and localize the expression of 

fte encqdedmarkerproteins. The antibody, for example, may of 
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methods arid used to visualize the localised concentration Of the: cells producing the eficoded 
protein. 

A radionuclide may be bound to an antibody either directly or indirectly by using an 
intermediary functional group. Intermediary functional groups which are often used to bind 
5 radioisotopes which eiist as metallic ions to antibody rire diethylenetriaminepentaacetic acid 
(DTPA) and ethylene diaminetetriaceticacid (EDTA). Examples of metallic ions suitable for use in 
' this disclosure are ^%; < 3, I ^RuT^Cu, ^Ga, ,2 *I, 68 Ga, n A*;* 9 Zr, and 201 T1. 

In accordance: with' this disclosure, the monoclonal antibody or fragment thereof may be 
labeled by any of several techniques known to the ait. The methods of the present disclosure may 
1 0 alsoiise paramagnetic isotopes for purposes of in tf/va detection. Elements particularly useful in 
Mapietic ( W MRI") include !57 Gd, 55 Mh, I62 Dy > ^Cr, arid 56 Fe. 

Administration of the labeled ^tibody thay be local Or systemic and accomplished 
intravenously, intraarterial, via the spinal fluid or the like. Administration may also be 
intradermal or intracavitary, depending upon the body sit^ under examination. After a sufficient 
15 time has lapsed for the monoclonal antibody or fragment thereof to bind with the diseased tissue, 
for example 30 minutes to 48 hours, the afea of the subject xinder investigation is examined by 
routine imaging tecl&itjties such as MRI, $P£dT, plafcar scintillation imaging and emerging 
imaging technique^, as well The exact prOtocol riecessMiily varies depending upon factors specific 
• f o the patier^ examination, method of 

20 administration and type of label usedl The determination of specific procedures is routine in the art. 
The distribution of the boimdi' radioactive isotope and its increase or decrease with 'time is then 
monitored and recorded. By comparing tfife ife^iiits with data obtained from studies Of Clinically 
' ' ^nlrh^ extertt of the diseased tissue may be detenhined. 

1 n ' thTms^ ftf dta$e staite cells by their fcffeOt on gene 

25 expression in immune system lymphocytes. In early stages of the disease state* such! immune 
m * re^risemay ^ localized: Fbr eikmplfi, the response may be limited to lymph riOdes immediately 
f of 
differentially expressedlli§e&& Matte ina&erfc tnay be Of titility for Separ&titog disease states of 
^ J W a detection 
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means is thereforeof significance in the management and care of patients with the disease state. It 
will be recognized that this utility is included within the scope of the present disclosure. 

6. Kits 

In still further embodiments, the present disclosure concerns immunodetection kits 
for use with the immunodetection methods described above. As the encoded proteins or peptides 
may be employed to detect antibodies and the corresponding antibodies may be employed to detect 
encoded proteins or peptides, either or both of such components may be provided in the kit The 
immumtdctcciion kits thus comprise, in suitable container means, an encoded protein or peptide, or 
a first antibod> that binds to an encoded protein or peptide, and an immunodetectionreagent 

In certain embodiments, the encoded protein or peptide, or the first antibody that binds to 
the encoded protein or peptide, may be bound to a solid support, such as a column matrix or well of 
amicrotitcrplate. 

The immunodetection reagents of the kit may take any one of a variety of forms, including 
those detectable labels that are associated with or linked to the given antibody or antigen, and 
detectable labels that are associated with or attached to a secondary binding jigand. Exemplary 
secondary Iigands arc those secondary antibodiestiiat have binding affinity for the first antibody or 
antigen, and secondary antibodies that have binding affinity for a human antibody 

Further suitable immunodetection reagents for use in the present kits include the two- 
component reagent that comprises a secondary antibody that has binding affinity for the first 
antibody or antigen, along with a third antibody that has binding affinity for the second antibody, 
. the tWrd antibody be^ 

The kits may further comprise a suitably aliquoted composition of the encoded protein or 
polypeptide antigen, whether labeledor unlabeled;, as may be used to prepare a standard curve for a 
detection assay. 

l%e kits may contain antibody-label conjugates either in, fully conjugated form, in the form 
interinediates, or as separate moieties te^be conjugated by the user of the, kit. The components of 
the kits npray be papkaged either in aqueous media or in lypphilizedform. . \ 

Tht container means of the kits gener^ly inpludes at least^ne. yial, test tube, flask, bottle, 
syringe or other container means, into which the antibody or antigen may be placed, and preferably, 
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suitably ahquoted. Where a second or third binding ligand or additional component is provided, 
7 the kit also generally contains a second, third or other additional container into which this ligand or 

Coinpoheht may be placed. The kits of the present disclosure also typically include a means for 
Containing the antibody, Antigen, and any other reagent containers in close confinement for 
5 commercial sale. Such containers may include injection or blow-molded plastic containers into 
which the desired vials are retained. 

E. Detection and Quantitation of RNA Species 

6rie embodiment of the initant disclosure comprises a method for identificationof a disease 

10" State in a biological sample by amplifying arid detecting nucleic acids corresponding to disease 
£tate markers. The biological sample may fee any tissue or fluid in which lymphocyte Cells ihight 
be present. Various embodiments include bone marrdw kspiirate, bone martow biopsy, lymph node 
aspirate, lymph node biopsy, spleen tissue, fine needle aspirate, skin biopsy or orgaii tissue biopsy. 
Other embodiments include samples of body fluid such as peripheral blobd, lymph fluid, ascites, 

1 5 serous fluid, pleural effeion, ^utuni, cerebrospinal fluid, lacrimal fluid, stool or urine. 

Nucleic acid used as a template for amplification is isolated from cells contained in the 
biological sample, according to conventional methodologies. (Sambfook et.al y 1989) The nucleic 
acid may be genomic DNA or fractionated or whole cell RNA. Where RNA is used, it may be 
desired to convert the RNA to a complementary cDNA. In one embodiment, the RNA is whole cell 

20 RNA arid is used directly as ^ 

Pairs of primers that selectively hybridize to nucleic acids corresponding to disease stated 
sp^ific markers are contacted with the isolated hUdeib acid ^ conditions that permit selective 
•hybridizzitiori. Once hybridized, thfe nucleic acidtprimer coniplex is contacted witlv one or more 
enzymes tHat facilitate template^dependent nucleic acid synthesis; Multiple rotmds of 

25 amplification, also referred to as "cycles^" are conducted Until a sufficient amount of aniplification 

Next, the amplification product is detected. In certain ap^i£^o^fe be 
"perform^ by Visual means. A3tehiaifiVeIy, tHe detection may MVolVe itSdirec^ 
product via chemili^ineSte^ or fluorescent 
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label or even via a system using electrical or thermal impulse signals (Affymax technology; Bellus, 
1994). 

Following detection, one may compare the results seen in a given patient with statistically 
significantreference groups of normal individuals and patients with the, disease state. In this way, it 
is possible to correlate the amount of marker detected with various clinical states. 

L Primers 

The term primer, as defined herein, is meant to encompass any nucleic acid that is 
capable of priming the synthesis of a nascent nucleic acid in a template-dependent process. 
Typically, primers are oligonucleotides from ten to twenty base pairs in length, but longer 
sequences may be employed. Primers may be provided in double-strandedpr single-stranded form, 
although the single-stranded form is preferred. 

2. Template Dependent A mplification Methods 

A number of template dependent processes are available to amplify the marker 

sequences present in a given template sample. One of the best known amplification methods is the 

polymerase chain reaction (referred to as PCR) which is described in detail in U.S. Patent Nos. 

4,683,195,4,683,202 and 4,800,159, and in Innis erai, 1990, each of which is incorporated herein 
by reference in its entirety. 

Briefly, in PCR, two primer sequences are prepared which are complementary to regions on 
opposite complementary strands of the marker sequence. An excess of deoxynucleoside 
triphosphates is added to a reaction mixture along with a DNA polymerase, e;g. r Td? polymerase. 
If the mark er sequence is present in a sample, the primers bind to the marker and the polymerase 
causes the primers to be extended along die marker sequence by adding on nucleotides. By raising 
and lowering the temperature of the reaction mixture, the extended primers dissociate from the 
marker to form reaction products, excess primers bind to the marker and to the .reaction products 
and theprocess is reputed, \ .J 

, ; A averse transcp^^ be performed in order to quantify 

the amount of mRNA amplified. Methods of reverse transcribing RNA into cDNA arewell known 
and described in Sambrook et al, 1989. Alternative methods for reverse transcription utilize 
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thermostable DNA polymerases. These methods ;are <ie£cnbedm WO 90/07641 filed December 
21,1 990;. Pbiyhaerase chain reaction methodologies are Well known in the art 

Alternatively, Ktf A species can be quarititated by means thait do not necessarily require 
amplification by PCR. These means may include other amplification techniques, for example, 
5 isothermic arii^lificati^h techniques Such as tlie one developed l>y Gen-Prdbfe (San Diegov CA), and 
the ligase chain reaction ("LCR"), disblosed in EP A Na. 320 308* incorporated herein by reference 
in its entirely. In LCR, two complementary probe pairs are prepared, and in the presence of the 
target sequence^ each pair binds to opposite complement^ st^ds of the target such that they 
ibtit. In the presence of a ligase, tlie two prdbei plairs link to form a single unit By temperature 
10 x^clirig, as in PCR, boun<3 ligated units dissociate frbm the target and- then serve as "target 
"sequences" for ligation of excess probe pairs/ U.S Patent 4,883,750 describes a method similar to 
LCR for binding probe pairs to a target sequence. 

^betafe^piicase, described in PCT kppKcatiofi N6: PCI7US87/008^0, mdy also be used as 
slill another amplification the pre^fefit invention- In this method, a replicati ve sequence 

15 of RNA which has a region compfemeritary to that of a target is added to a sample in the presence 
of an RNA polymerase. The polymerase bdpies the replieati then be 

detected. * '' ryr: / : : ' "\ ; " ■ • 

An isotheimal ^plificaaori riiethod, in which reltri ctidh endonucleases and ligases are 
used to achieve the am^lifieatibh of target molecules that contain nucleoside 5'^alpha-thio]- 
20 triphosphates in one strand of a restriction site may also be %sefiil in the amplification of nucleic 
h adidr& ^ pres^ Mention Wsflfcer et al. 9 Ptoc. im f AM: Sci;WS& 8fc392^9& (3992), 

' ^fhe6^ir4fedhereui by i^ferenc^ in itb entirety. , s-v^r 

H #f strand Bi^lacement Ainp^ df casing but isothermal 

' rin^lMsati^ involves multiple rbUndis of Strand displaced 

25 ^ ^ 

:; ^^k^tptobds throughout a regibii targeted for a^piSGc^ion^ follow^ by a repair reaction in 
which only two of the four bases are present. The other two bases may be addeft as bi^toylated 
d^V^es for easy detrctfcn, A <mtti^^ 
v • alsbibede^^^ In CPR, a prbbe^fiaving 3 f mAS sequences of 

30 non-^cifici&NA and a middle sequent present 
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in a sample. Upon hybridization, the reaction is treated with RNase H, and the products of the 
probe identified as distinctive products which are released after digestion. The original template is 
annealed to another cycling probe and the reaction is repeated. 

Other amplification methods are described in GB Application No. 2 202 328, and in PCT 
ApplicationNo. PCT/US89/0I 025,each of which is incorporated herein by reference in its entirety, 
may be used in accordance with the present invention. In the former apphcation, "modified" 
primers are used in a PCR like, template and enzyme dependent synthesis. The primers may be 
modified by labeling with a capture moiety <e^, biotin) and/or a detector moiety (e,g., enzyme). In 
the latter application, an excess of labeled probes are added to a sample. In the presence of the 
target sequence, the probe binds and is cleaved catalytically. After cleavage, the target' sequence is 
released intact to be bound by excess probe. Cleavage of the labeled probe signals the presence of 
the target sequence. 

Other nucleic acid amplification procedures include transcription-based 
systems (TAS), including nucleic acid sequence based amplification (NASBA) and 3SR. Kwoh 
et al., Proc. NartAca& Sci.U&i 86:1 173 (1989); Gingeras et al, PCT Application WO 88/10315, 
incorporated herein by reference in their entirety. In NASBA, the nucleic acids may be prepared 
for amplification by conventional phenol/chloroform extraction, heat denaturation of a clinical 
sample, treatment with lysis buffer and minispin columns for isolation of. DNA and RNA or 
guanidinium chloride extraction of RNA. These amplification techriiques involve annealing a 
primer which has target specific sequences. Following polymerization, DNA/RNA hybrids are 
f digested with RNase H while double stranded DNA molecules are heat denatured again. In either 
case the single stranded DNA is made fully double stranded by addition of second target specific 
primer, followed by polymerization. The double-stranded DNA molecules are then multiply 
transcribed by a polymerase such as T7 orSP6. In an isofiiermal cyclic ^^ reactiqn, theJRNA's are 
reverse transcribed into double stranded DNA, and transcribed once against with a polymerase such 
as T7 or SP6. The resultmg produe^, whether truncated or complete, indicate target specific 
■ - v.. - . h sequencesL y~~i % w - ; ' , .. r .,- i - ? ■■....= •, 

• - ' •" ^vey e tal <> EPA No; 329 822 (incorporated herein by reference in its entirely) disclose a 
v nucleic acid amplification process involving cyclically synthesizing smgle^stranded RNA 
30« < (C which may be used m accpnlance with ^ 
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' the present invention. The ssRNA is a first template for a first primer oligonucleotide, Which is 
elongated by reverse transcriptase (RNA-dependent DNA polymerase). The RNA is then removed 
froin the resulting DNA :R N A duplex by the action of nbonucldaseH (RNase H, an RNase specific 
for RNA in duplex with cither DMA or RNA). The resultiant ssDNA is a second template for a : 
5 second primer, which ^ 

T7 -RNA polymerase) 5' to its homology to the template. This primer is then extended by DNA 
polymerase (exemplified by the large "Klenow" fragment of 2?. coli DNA polymerase I), resulting 
in a double-stranded DNA CdsDN A") molecule, having a sequ^riceideintical to that of the original 
;> RNA betwfeen the primers and Ha ving additionally, ait one fend, a promoter sequence. This promoter 

10 seqtierice may fee used by the appropriate RNA pbiyiiierase to make many RNA copies of the 
DNA. These copies may then re-ehtcf the cycle leading to very swift amplification. With proper 
chbice of ei^ isOtBerinally without addition of enzymes at 

each cycle. Because of the cyclical nature of this process, the starting sequence may be chbsen to 
be in the form of feither ^ : ^^ ? r 

15 Miller er a/., PCT £j>pU^ by refererice in its 

eiifii^ty) ^isclbse % ™ the hybridization of a 

promoteiv^rimer sequel 

inahy RNA eopies of%ef Seqiience. This scheiniiefe riotcyclic, te , hew templates are not produced 
froin the resultant RNA transcripts. Other amplification inethods include "race" and -'one-sided 
20 VCk? Froiim^i, MA, W: VCR PkOTOCOlS: 'A ; GUIDE TO METM O&S AND 
APPU&iTiONS, Atadcnkt Trek, N.Yv (1990) tod«irt, Pr&C, tfat 't Acad -Set USA, 

^ ? Methbds based on ligation 6f two (c^r more) dligonueleofideSin the presenee^f iititfeic acid 
h^ing life Sequehee of the resulting "di^ligdhiiele^ide^ c thereby anipfyittg the di- 
25 &igbhucle<^*te, may Mb be Med in the amplMtation Stej> tff the preseftt inveiitioii. ^ et al 9 

' ' Anex^^bFate^S^ 

u^toiju^yS^ There ^e-many d^erent 

J 1 Versions of fcubtease protection assays RrtoWi to those Jiifidticed'iii the art The characteriistic that 
30 all versions of nuclease protection assays share in common is that they involve hybridization of an 
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aritisense nucleic acid with the RNA to be quantified. The resulting hybrid double stranded 
molecule is then digested with a nuclease that digests single stranded nucleic acids more efficiently 
than double stranded molecules. The amount of antisense nucleic acid that survives digestion is a 
measure of the amount of the target RNA species to be quantified. An example of a nuclease 
protection assay that is commercially available is the RNase protection assay manufactured by 
Ambion, Inc. (Austin, TX). 

3. Separation Methods 
? Following amplification, it may be desirable to separate the amplification product 

from the template and the excess primer for the purpose of determining whether specific 
amplification has occurred. In one embodiment, amplification products are separated by agarose, 
agarose-acrylamide or polyacrylamide gel electrophoresis using conventional methods. See 
Sambrooke/a/., 1989, 

Alternatively, chromatographic techniques may be employed to effect separation. There are 
15 many lands of chromatography wKich may be used in the present invention: adsorption, partition, 
ion-exchange and molecular sieve, HPLC, and many specialized techniques for using them 
including column, paper, tliin-layer and chromatography (Freifelder, 1982), 

Another example of a separation methodology i§ done by coyalentiy .labeling the 
oligonucleotide primers used in a PGR reaction with various types of small molecule ligands. In 
one such separation, a different ligand is present on each oligonucleotide. A molecule, perhaps 
an antibody or avidin if the ligand is biotin, that specifically fcinds to one of the ligands is used to 
coat the surface of a plate such as a 96 well ELISA plate. Upon application of the PCR reactions 
to the surface of such a prepared plate, the PCR products are bound with specificity to the 
surface.. After washing the. plate to remove unbound reagents, a solution containing a second 
25 molecule that binds to the first ligand is added. This second molecule is linked to some kind of 
reporter system. The second . molecule only binds to the plate if a PCR product v has been 
pro^uce^swhereby both oligonucleotide primers, are, incorporated into the final PCR products. 
eiy^;^^^y^%.^bR PCR product is , then detected .and quantified in a commercial plate reader 
inveh as ELISA reactions; are detected and quantified. An ELISA-like system such , as the one 
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described here has been developed by the Raggio Italgene company under the C-Track trade 
name. 

4 Identification Methods 
5 V: Aniplificatioh products must be visualized in order to cohiSim' amplification of the 

marker sequences. One typieal visualization method involves staining of a gel with ethidium 
bromide and visualization under UV light. Alternatively, if the amplification products are 
integrally labeled with radio- or fluotometricklly^ the amplification products 

ftiay then be expoisied to x-ray film or visualized under the appropriate stimulating spectra, 
10 following separation. ' 

In one embodiment, visualization is achieved indirectly. Following separation of 
amplification products, a labeled, nucleic afcid probe is brought into cohtiact with the amplified 
marker sequence! The probe preferably i£ Conj iigiated to a chromdphore but may be radiolabeled. 
In another embodiment, the probe is conjugated to^ a binding partner^ such as an antibody or biotin, 
15 where the b^rmeitit^rbfthe 

In one embddimeht, detection is by Southern blotting and hybridization with a labeled 
probe. The techniques involved in Southern blottirig are well known to those of skill in the art and 
may be found in many standard books on molecular protocols. SeeSambrook et til, 1989; Briefly, 
amplification products are separated by gel electrophoresis. The gel is then contacted witli a 
20 membrane, such as hitiw nucleic acid and noii-covalent binding. 

Subsequently, the membrane is Iriciibated witka chromophore^conjugated^robe that is capable of 
L H f %ybridm«g\vith a t^get amplification prod^ Detection is by exposure of the membrane to x-ray 
film or ito , ^ 

• ' : > One %Xam^le of the foregoing fe described in U S ^atent Mo. 5,279,721, incorporated by 
25 reference heireiti, which discloses ah apparatus and method for the automated electrophoresis and 
^transfer of nucleic acids. The app^iratus^pennits electrophoresis and blotting without external 
manipulation of the gel and is ideally suited to carrying out methods acebhiihg to the present 
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5. Kit Components 

All the essential r^aterials and reagents required for detecting disease state markers in a 
biological sample may be assembled together in a kit This generally comprises preselected; 
5 primers for specific markers. Also included may be enzymes suitable for amplifying nucleic acids 
including various polymerases (RT, Taq* etc.), deoxynucleotides and buffers to provide the 
. necessary reaction mixture for amplification. 

Such kits generally compnse, ia suitable means, distinct containers? for each individual 
reagent and enzyme as well as; for each marker primer pair. Preferred pairs of primers for 
10 amplifying nucleic acids are selected to amplify the sequences specified in Genebank Accession 
numbers P87451 V T03013, X03558, M28130, Y00787, SEQ ID NO:l, SEQ ID NO:2, SEQ ID 
NQ:3, SEQ ID NO :4, SEQ ID NO: 5, or SEQ ID NO:29. 

In another embodiment, such kits comprise hybridization probes specific for disease state 
markers, chosen from a group including nucleic acids corresponding to the sequences specified in 
15 Genebank Accession numbers D87451, T030I3, X03558, M28130, Y00787, SEQ ID NO:l, SEQ 
ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:29. Such kits generally 
comprise, in suitable means, distinct containers for each individual reagent and enzyme as well as 
for each marker hybridization probe. ?v 

20 F* Use of RNA Fingerprinting to Identify Markers of Human Disease 

RNA fmgerprintHig is a means by which RN As isolated from many dW^nt tissues, cell 
i/-: : be sampled ^ultan^i^ly, to identify RN As whpse relative 

abundances vary. Two forms of this technology were developed simultaneously and reported in 
1992 as RNA fingeiprintmgb^^ 1992). 
25 (See also Liang and P^ee, y.S; Patent 5^62^31 1,^ 

Some of the studies described herein were performed similarly to Donahue e/ or/., J. Biol Chent 
. 269^604^609^ 1994,^^^^^ ^ : V v . 

All forms of RNA fingerprinting by PCR are theoretically similar but differ in ih&x primer 
design and application. The most striking difference between differential display and other 
30 methods of RNA fingerprinting is that differential display utilizes anchoring primers that hybridize 
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to the poly A tails of mRNAs. As a consequenbe, the PCR products amplified in differential 
display are biased towards the 3' untranslated regions of mRNAs. 

The basic technique of differential display has been described in detail (Liang and Pardee, 
1992). Total 'cell RNA is primed for first strand reverse inscription with an anchoring primer 
5 -composed of bligo dT. The oligo dT primer is extended using a reverse trianscriptase, for example, 
"Moloney Murine Leukemia Virus (MMLV) Reverse tratascri^tiase: The synthesis of the second 
strand is primed with art arbitrarily chosen oligonucleotide, using reduced stringency conditions. 
Once the double-stranded cDNA has beeri synthesized/^ proceeds by conventional 

^GR : tccKntqW^r'uiiiMng the same primers The resulting DNA fingerprint is analyzed by gel 
10 Electrophoresis iin^ clhidium bromide Staining or autoradiography. A side by side comparison of 
^fin%etprfnus obtained from different bell derived RN As using the same 1 oligonucleotide primers 
identlfiersmRNAs that are differentially expressed. - ' 

RNA fingerprinting technology has been demonstrated as being effective in identifying 
genes than are di ft erent tally expressed in cancer cells (Liaing eir ^A, 1992; Wong et at.-; 1993; Sager 
15 etaL, 1993; Mok e/ a/., 1994; WMsdhe^/., 1994; Cheri:^/ 1995; An et a/ , 1995). The present 
disclosure utilizes the RNA fingerprinting technique or other techniques described herein to 
identify gdries that are differential ly expressed in peripheral blood tells in fiiimah disease states* 

G. ^ 

20 Reverse trariscription (RT) of RNA to cDNA follo\V6d by relative quantitative PGR (RT- 

PCR) may be used to detbnninfc the relative ebneientratioris of specific niRN A%»eci^ ih a »ries of 
total cell RNAs isolated from peripheral blood of normal individuals Smd ^ 
^Sttfte. fey (fetehtfiiiing that the concentration of a specific ml^A ^efeies varies it is^hown that the 

25 ■ •' 1 to fcbnfiiW th^t ittftNA tr^ shown to fee diff^htiSlly regulated by RNA fiiigedrprinting are 
^dil^tMaiy^essed irt dise&e st^e p^gi^oh. ^ • • - ^ 

~w PCfc; the' number of nioigeules of the amplified target DN A irifc^^ 

y/ ^ two Witli ^Very fcycle of the teactioh ttntil sonie regent beeoriies 

the rate of amplification becomes increasingly diMt^hMHjhtil thcjrfe is^iibt ^ afi u ^feKs^^iii the 
30 amplified target between cycles. If one plots a graph on which the cycle number is on the X axis 
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and the log of the concentration of the amplified target DNA is on the Y axis, one observes that a 
curved line of characteristic shape is formed by connecting the plotted points. Beginning with the 
first cycle, the slope of the line is positive and constant. This is said to be the linear portion of the 
curve. After some reagent becomes limiting, the slope of the line begins to decrease and eventually 
becomes zero. At this point the concentration of the amplified target DNA becomes asymptotic to 
some fixed value. This is said to b^ me plateau portion of the curve. 

The concentration of the target DNA in the linear portion of the PCR is directly 
proportional to the starting concentration of the target before the PCR was begun. By determining 
the concentration of the PCR products of the target DNA in PCR reactions that have completed the 
same number of cycles and are in their linear ranges, it is possible to determine the relative 
concentrations of the specific target sequence in the original DNA mixture. If the DNA mixtures 
are cDNAs synthesized from RNAs isolated from different tissues or cells, the relative abundances 
Of the specific mRNA from which the target sequence was derived may be determined for the 
respective tissues or cells. This direct proportionality between the concentration of the PCR 
15 products and the relative mRNA abundances is only true in the linear range portion of the PCR , 
reaction. 

The filial concentration of the target DNA in the plateau portion of the curve is determined 
by the availability of reagents in the reaction mix and is independent of the original concentration 
of target DNA; Therefore, the first condifion that must be met before the relative abundances of a 
20 mRNA species may be determined by RT-PCR for a collection of RNA populations is that the 
concentrations of the amplified PCR products must be sampled when the PCR reactions art; in the 

The secondcondition that must be met for an RT-PCR study to successfully determine the 
relative abundances of a particidar rnRN A species is that relative c»n(^ntrations of me ampjifiable 
■■ «DNAs must, be normalized to some independent standard. The goal qf an RT-PCR study is to 
determine the abundance of a particular mRNA species relative to, me^yerag^^ of all 

mRNA species in the s^ple, In the studies described beloWj mRNAs fqr B-acfin, asparagine 
^synAetase ^ were used as external and mtema| standards to which die relative 

abundance of othermRNAs are compared. 
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iviost protocols for competitive PGR utilize internal PGR standards that are approximately 
as abundant as the target. These strategies are effective if the products of the PGR amplifications 
are sampled during their linear phases. If the products are sampled when the reactions are 
approaching the plateau phase, then the less abundant product becomes relatively over represented 
Comparisons of relative abundances made for many tiiffdferit'RNA s^ples, such as is the case 
v&en examining RNA samples for differential expression, befcdrite distorted in such a way as to 
make differences in relative abundances of RNAs appear less than th^y actually are. This is not a 
sighifidaht problem if the internal standard is 'much mofe abundant than the target. If thfc internal 
standard is more abundant than the target, then direct linear comparisons may be made between 
RNA^ampies: 

- - Thd ^iscu&ibn above describes the theoretical ^ohsidei^tions for Im RT-PCR assay for 
clinically derived materials. The problems inherent in clinical samples are that they afe of variable 
quantity (making norhiali^tiOn problematicX arid that they are of ^variable quality (necessitating the 
co-ampiificatibn of a reliable internal control;, preferably of larger Size than the target). Both of 
these problems are overcoiiie if the RT-PCR is perfbrmed as a relative quantitative RT^PGR with 
an irttemal standard in which the internal sfe&d^ i^ fragment that is larger 

; tfiaii the target cDNA fragmeht and iri Wtiidh thfe abundance of the mRN A encoding the internal 
standard is roughly 5-100 fold higher than this AiRNA encoding the target; This assay measures 
relative abundance, not absolute abundance of the respective MRN A spedes. - 

Other studifes m^y be performed usiilg a more coiiventioflal relative qtitotitatiVe RT-PCR 
\v^^^6i^ PCR >roduc^ itt the lihe^ jSoiftion of 

their amplification curves. The number of PGR cycles that are optimal for sampling must be 

' Impm^ for bach target cDN&;fo transcriptase 

^r^ducts df RNk popiilatlbri isfcftto 

ncimkliied for mal ctf^^ the 
%iear r&ng£ of tife' amplifirato pttparatioitS sire tedious and 

derived from a relative quantitative RT-PCR with an internal standard. 

%e cohwi^ii^a siiigle PGR prdd^ct ih the ltt 
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sensitivity of the assay. Another reason is that with only one PCR product, display of the product 
on an electrophoretic gel or some other display method becomes less complex, has less background 
and is easier to interpret. 

5 FL Diagnosis and Prognosis of Human Cancer 

In certain embodiments, the present disclosure enables the diagnosis and prognosis of 
. human cancer by screening for marker nucleic acids. Various markers have been proposed to be 
j correlated with metastasis and malignancy. They may be classified generally as cytologic, protein 
. or-nucleicacid markers. 

10 Cytologic markers include such things as "nuclear rouridedness" (Diamond et al. y 19$2) and 

cell ploidy. Protein markers include prostate specific antigen (PSA) and CA125. Nucleic acid 
markers have included amplification of Her2Aiew, point mutations in the p53 or rgs genes, and 
changes in the sizes of triplet repeat segments of particular chromosomes. 
AU of these markers e 

15 negatives. A false positive result occurs when an individual without malignant cancer exhibits the 
presence of a "cancer marker". For example* elevated serum PSA has been associated with prostate 
carcinoma. However, it also occurs in some individuals with non-malignant, benign hyperplasiaof 
: the prostate. A false negative result occurs when an individual actually has cancer, but the test fails 
to show the presence of a specific marker. The incidence of false negatives varies for each marker, 
20 and frequently also by tissue type., For example, ras point mutations have been reported to range 
from .a high of 95 percent in pancreatic cancer to a low of zero percent; in some, gynecologic 
..cancers. _ , ,.r, v •-.,•«-.. 

a AddiUonal problems arise when a marker is present only within the transformed cell itself. 

25 t for example-, the serum or urine of individuals with ros-actiyated carcinomas. .This means that, m-V: 
order to detect a maligra^ must take a sample of^^ tum|or. itself, or its metastatic cells. 

- ; fiiEssen^aUy ^ 



. may be detected. ,^::;%v.;VA^ 
- 5 ^ FwaUyi specific problems o^ that are present in normal cells but absent in 

30 : cancer cells. Most tumor samples contain rnixed populations of both normal and transformed cells. 
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If one is searching for a marker that is present in normal cells, but occurs at reduced levels in 
transformed cells, the "background" signal from the normal cells iii the sample may mask the 
presence of transformed cells. 

The ideal disease state marker Would be one that is present in individuals with the disease 
5 state, and either missing or expressed at significantly lower levels in normal individuals. The 
present disclosure addresses this need, in the casfe of metastatic prostate cancer for example, by 
identifying several new nucleic acid markers that are expressed at higher levels in individuals with 
metastatic prostate cancer than in normal individuals. In particular, the results for markers UC302 
(SEQ ID #3) and UC325 (SEQ ID #4) are quite promising in that these markers are apparently only 

10 overexpreSsed in the peripheral blood of individuals with metastatic tumors and ire present at 
relatively low levels in norm 

Further, since the markers are present iri the whole blood of individuals with the disease 
state, the present detection method avoids the problem of having to suspect a ttimor is in place 
before it may be sampled. The instant disclosure has utility as a general screening tool for 

15 asymptomatic individuals, as well as a means of difterentially diagnosing those patients whose 
tumors have already metastasized. Depending upon the type of tumor involved, such individuals 
may be selected for systemic forms of ^ti-caneer therapy rather than surgical removal of localized 
tuiiior masses Certain individuals with advanced forms of highly maJignant metastatic tumors may 
bie optimally treated by pairi management alone. 

20 It is anticipated that in clinical applications, human tissue sample^will be sdreented for the 

presence of the disease state markers identified herein. Such samples would rioimally colisist of 
peripheral blood, but may also consist of needle biopsy Cores or lymph node tissue. Iri certain 
embodiments, nucleic acids Would be extracted frbm Iheisc samples and ^plified as d&cribed 
abdveV Some embodiments would utilize kits containing pre-selCcted primer pairs or hybridization 

25 ' prdfees. The amplified nucleic acids would be tested for thef markers by, for example, gel 
clectrophorasis ^d ei&idium bromide staining, or Southern Wotting, or a solid-piiase detection 
^ifieari^^d^ 

x ' iii^kei^ fleti&ted Would be Compared with stati^cally vMid groups individuals witfrmetastatic, 
^ri^metastail3 or benign tumdrs or normal individuals. The diagnosis arid prognosis of 

30 the individual patient would be determined % compari^rt <r 
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Another embodiment of the present disclosure involves application of RT-PCR techniques 
t 9 detect a disease state using probes and primers selected from sequences comprising Genebank 
Accession numbers D87451, T03013, X03558, M28130, Y00787, SEQ ID NOrl, SEQ ID NO*2 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:29. Similar techniques have been' 
described in PCT Patent Application No. WO 94/1 0343, incorporated herein by reference. 

In this embodiment, the disease state is detected in hematopoietic samples by amplification 
of disease state-specific nucleic acid sequences. Samples taken from blood or lymph nodes are 
treated as described below to purify total cell RNA. The isolated RNA is reverse transcribed using 
a reverse transcriptase and primers selected to bind under high stringency conditions to a nucleic 
acid sequence from a group comprising Genebank Accession numbers D8745 1 , T030I3, X03558, 
M28130, Y00787, SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NOA, SEQ ID NO:5, or 
SEQ ID NQ:29. Following reverse transcription, the resulting cDNAs are amplified using 
conventional PCR techniques and a thermostable DNA polymerase. 

The presence of amplification products corresponding to disease state-marker nucleic acids 
may be detected by several alternative means. In one embodiment, the amplification product may 
be detected by gel electrophoresis and emidium bromide staining. Alternatively, following the gel 
electrophoresis step the amplification product may be detected by conventional Southern blotting 
techniques, using an hybridization probe selected to bind specifically to a disease state-marker 
nucleic acid sequence. Probe hybridization may in turn be detected by a conventional labeling ; 
means, for example, by incorporation of [ 32 P]-nucIeotides followed by autoradiography. The 
,^pIificationp^ £ - 

thpse utilizing a disease state-marker specific hybridization probe and an appropriate labeling 
means, or ever* the ELIS A-Iike system known as C-track™ as described above. ! 1 . 
^ r ^ following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those ofskill in the art that the tecinuques disclosed in the ; 
example^ which follo^. represent teclunques discovered by the inventors to function weil in the 
practice of m^nvention, and m^ 

How^tho^^f^fi many 
changes ina^y be made Hke or 

similarresult without deparfi^frbm&e spirit and sco^ 
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L Materials and Methods • 

/. Application of RNA fingerprinting to discover biomarkersfor disease strifes : } > 

RNA fingerprinting (accordihg to Liang arid Pardee, 1 992; Welsh ei aL; 1 992; 
5 Liang and Pardee, 1993) was applied to nucleic acids isolated from the peripheral blood of 
individuals with metastatic prostate cancer, compared with normal individuals. 

Blood was draSvii irbm cancer patients arid normal individuals into Vacutainer CPT tubes 
''-'''^fti'ficbl gradients (Becton Dickinson and Company, Fraxikih The tubes were 

" ceritrifuged to separate the red blood cells froiri vairious tyjpes of nucleated cells, collectively 
10 " r^rred to as the btiffy cbat^ artd from blood pl^Sriik Total cell RNA was isolated^from the buffy [ 
' ^■■^66kts-l>y-thte RNA ST^T-60toe^^ isolation, the 

nucleic acids were precipitated 

redissolved in water. The redissolved nucleic acids were then digested with^I^ I 
(Boehringer Mannheim, Inc:) following the manufacturer's instructions, followed by organic 
1 5 extraction with phenolxhlotofoi^ etharioL 

The DNase 1 treated RNA was thfeh pelleted by cehtrifugation and redissolved in water. 
The purity arid cohceiifratibnof the RNA in solution was estimated by determining optical density 
at wave lfeiigths of 260 then examined by 

efectro]^ The 
20 RNA Was then divided intb three Miquots. Oiie aliquot was set aside for relative quantitative RT- * 
n PCR corifirihatiti^ ^ - 

< A second aliquot was iiised to fihgerpririt the RNA by converting the RNA to first strand 
" ^ cDNA by PGR using 

^ 9^ the resulting PCR 

25 amplified products on an agarose g£l 
^ap^eani^ 
r , The RNA ^f ^ t^ 

i iridividu^^ of patients *vitb metastatic prostate The 

" pools ^Were' flngei^iiilted using: th& r s&pential ^a^vi^ &ethcKi^^^^^ :Mbitrai^y primod. PCR 
30 - fingerprinting of RNA (McClelland et a/., 1994, J\foc/eic *4c^ 
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incorporated herein by reference) with several changes. For example, arbitrary oligonucleotides of 
15 to 24 nucleotides were used with Taq polymerase, and one tenth of each first strand cDNA 
reaction in each arbitrarily primed PCR reaction. One hundred and 200 ng were used in each first 
strand cDNA synthesis, respectively. Certain genes disclosed herein were discovered by the 
5 . sequential pairwise method. 

2- Methods Utilized in the RNA FingerprintingTechniqtte 
^ The second type of RNA fingerprinting studies performed more closely resembled 

j the protocol of Welsh et al (1992). This approach used a variation of the above as modified by the 
10 use of agarose gels and non-isotopic detection of bands by ethidium bromide staining (An et al, 
1995). Total RNAswere isolated from peripheral blood samples as described (Chomczynski & 
■ Sacchi, 1987). Ten micrograms of total cellular RNAs were treated with 5 units ofRNAse-free 
DNAse I (G1BCO/BRL) in 20 mM Tris-HCl (pH 8.4), 50 mM KC1.2 mM MgCl 2 ; and 20 units of 
RNAse inhibitor (Boehringer Mannheim), After extraction with phenol/chloroform and ethanol 
15 precipitation, me RNAs were redissolvedin DEPC-treated water. 

Two pg of each total cell RNA sample was reverse transcribed into cDNA using randomly 
selected hexamer primers and MMLV reverse transcriptase (GIBCO/BRL). pCR was performed, 
using one or two arbitrarily chosen oligonucleotideprimers (10-12mers). PCR. conditionswere: 10 
mM Tris-HCl (pH 8.3), 50 mM KC1, 1.5 mMMgCfe, 50 mM dNTPs, 0.2 mM of primers), 1 unit 
20 of Taq DNA polymerase (GIBCO/BRL) in a final volume of 20 nil. The amplification parameters 
included 35 cycles of reaction with 30 sec denaturing at 94°e> 90 sec annealing at 40°C, and 60 sec 
extension at 72°C. A final extension at 72^C w#s performed for 15 min. The resid PCR 
^ productswere resolved into a fingerprint by size separation by elecurOphor^thrpugJi 2% agarose 
gels in TBE buffer (Sambrook et al, 1989)^ The fingerorints were visualized by staining with 
25 ./ etbidium bromide. No re-amplification was perform !? .... ......... 

EHfTerentially appearing PCR products, lhat might represent difyerentiaUyexpressed genes, 
; .W^jmsa^ fromthegelwimai^rbMe,pu^^ kit 
(Bio lOil^ meO^lmed m wa 
strategy (Invitrogen, Inc., and Promega, Inc.). 
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3. XZonfWrhdtion of Differential Expression by ^Relative Quantitative RT-PCR: 
Protocols for RT-PCR 

a: Reverse transcription 
5 0rietofiV6jigo 

ihto cDNA Reverse transcription performed with 400 units 'b^lJii^y^itBWite transcriptase 
; (GlBtO/BRL) in the presence of 50 r^ Tri^HGl (pH 8^ mM ( 

DTT, 500 mM dNTP, 50 ng random hexamers per microgram of RNA, and 1 U/ml RNase 
inhib'itbr. The reaction voluirie was 60 ml. The reaction mixture was incubated at room 
10 temperature for 10 minutes, then at 37°C for 50 minutes. Afler reverse transcription the enzyme 
was denatured by heating to 65°C for 1 0 minutes. After heat denaturation the sariiples were diluted 
with water to a final vblurrie I of 300 ml. i 

RT : PCR was utilized to examine iriFtNAs for differential expression; The sequences of 
oligonucleotides used as primers to direct the amplification of the various cDNA fragments are 
15 presentee! in Table 3. 

b. Relaihti&a^ 

The concentrations of the original ttftal cell I^NAs were determined by me^urement 
of OD 26 (V28o (Sambrook et at! 9 1989) and confirmed by examination of ribosbmal RNAs on 
20 ethidium bromide stained agarose gels. It is required that all quantitative rl*CR reactions be ■ v; 
normalized for equal amounts of amplifiable cDNA after the reverse transcription is completed. 
One solution to this is to termtnkte tfedfcticfaS by ^^^th^^^R feacfiohs into plateau phase. 
This approach was utilized in some studies because it is quick and Efficient. Lipocortin II was used 
^theto 

^Marihheiiii incliidi^ i.5 3 M dillited <&>NA, 1 arid ^15 uMts of Tdq DNA 

pol^merase/1 Otfriil of reaction volume. ^ 
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Cycling parameters: 30 cycles of 94°C for 1 min; 55°C for 1 min; and 72°C for two min. 
Themiocyclers were either the M J research thermocy cler or the Stratagene kobocycler. 

a Relative Quantitative RT-PCR with an External Standard 
There are three problems with the relative quantitative RT-PCR strategy described 
above. Firsts the internal standard must be roughly 4-10 times more abundant than the target for 
this strategy to normalize the. samples. Second, because most of the PCR products are templated 
from the more abundant internal standard, the ass^y is less than optimally sensitive. Third, the 
internal standard must be truly unvarying. The result is that while the strategy described above is 
fast, convenient and applicable to samples of varying quality, it lacks sensitivity to modest changes 
in abundances. 

To address these issues, a normalization was performed using the 6-actinmRNA as external 
standard. These PCR reactions were performed with sufficient cycles to observe the products in the 
linear range of their amplification curves. The intensities of the ethidium bromide stained bands 
were documented and quantified using the Is 1 000 imaging analysis system manufactured by the 
Alpha Innotech, Corp. The quantified data was then normalized for variations in the starting 
concentrations of ampiifiable cDNA by comparing the quantified data from each study with that 
derived from a similar study which amplified a cDNA fragment copied from the B-actin mRNA. 
Quantified dlata that had been normalized to beta actin were converted into bar graph 
presentations. 

, , 4, » MultivariateAnalysi? of Prostate Disease State ^ .„. ^ 

i\ : ; a* Specimen Collection N :: .. .... 

Blood specimens (8-1,0 mis) were collected by venipuncture into standard serum 
or serum-separating tubes (Becton-Dickinson), allowed to coagulate for 30 minutes at room, 
temperature, and then centrifuged at low speed (100(bc ^g) for 10 minutes. Some spepimens 
coming were immediately fro^ pd ? shipped overnight i>y cleliveiy courier. Otiiers ware 
collected, processed, frozen, and shipped on dry ice by ; overnight mail. Upon arrival, all 
specimens were stored at -20°C. Repeated freeze-thaw cycles were avoided. 
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b. Measurement of Free and Total PSA 

Two commercially available assays were utilized to measure PSA concentrations, 
an IMMULITE solid-phase chemiliiminescence-based assay to measure free PSA (Diagnostic 
Products Corp.; Los Angeles, CA), and the FDA approved assay from TPOSOH (San Diego, CA) 
5 that utilizes an enzyme-conjugated monoclonal antibody and fluorescent substrate to measure , 
tOtaTPSA. However* since two different instruments were utilized to measure the components of 
the f/t PSA ratio, the international reference standards for free arid total PSA were utilized to 
calibrate both assays and calculate the "coirected'^f/t PSA ratio (Stam 

;\ 1 ■ W;> " • • • • ' - . 

10 c. f/t PSA Reference Standards arid Correction of fit P^A ratio } 

The corrected f/t PSA ratio was determined according to Marley et dt 9 1996. 
Reference standards for free and total PSA assays were purchased from the Stanford University 
Prostate Center and consisted of an equimolar mixture of 90% PSA-ct-l-antichymotrypsin and 
10% free-PSA (Stamey, 1995; Chen et al y 1995). All testing dilutions were performed with 1% 
15 bovine serum albumin (Fraction V; Sigma -Chemical Co:) in 20 mM phosphate-buffered saline 
(PBS), pH 7.4. Expected concentrations of the reference standards, determined from molar 
extinction coejfficients (e) y were also provided. 

r: ^ ree : ttoftal PSA ^says. were standardised as follows: fease the mean of seven 

linear standard curve runs of the reference standards (Stanley, 1995), correlation factors for free 
20 : v and total;- PSA m^i^tnent were ^Calculated / Slope (m) deviations Were njetoured relative to the > 
linfearplpt^ Since 
all curves passed through the origin, the correction" factor fbi &^t0t^ PSA ratio was 
calculated from the difference in slopes; lntra*assiay coefficimts of variation for free PSA (range 
- ^ 0-2.0iig/ml) and total'PSA ^ rig/nil) as^ys wei^ 7% antf ^ The 

" i$ v •■■ ■' confebtiori fi^i^ free Srid total PSA vikiubs^re 1,19 ^ 0:S3; respectively. For > 

7 ? x Iht (T0SOH) tot4 PSA assay teamed equally ' to fife free i&tf bound (PSA^&E) forms 
bf^PSA. The (tyimulite) &ee PSA assay sy^eift was Unable to detect the bound fraction^of PSA 
h - ■ j total and free PSA 

30^ wfere^ 
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Differences in free and ,„, a , senm pSA 
were exa.nined ,ne s^nssT k ^ B ™ 

(Volhner, T,^^,^ * Wneoxon ^ ^ 

Corpora-on, ^S^^^^^^ software package (S«a 

—ys as wel, as ». , ogis<ic regrcssjon _ detemUne «» «*— I cutoff for tte 

TV co^ ^ JZr, " * Steinberg a„ d ^ 

STATA™ ^^. ^ mdePen *'» ~» <™re a,sp da*™™, ^ £ 
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A commercial tt 5? • 

so,„ OM To pr ^™^ ydi, "- Rn '-^op so.nnonand 

**>• , 25 , «. 5 , 31 . 2 , ,5.,,;, ^ * * ■» ^ cneen^on,: 1000 _ 

IL-8 eon,„ga te . The wells were covered W.K * ^P^e standard) and 100 pi of 

b»«er,&ra wal Wsfa After «he final w, «» grated and washed wift wash 

^^Onnna,^^! 0 ^^ — «" »- * -X -U and 
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and mixed by gentle tapping. Optical density was measured within 30 mih of addition of stop 
solution, using a Bio-Tek EL-808 microplate reader (96 well format) at 450 nm. 

/ IL -8 Standard Curve and Coefficient of Variation (CV) 
5 < " The IL-8 standard curve consisted of eight concentrations: 1000, 500; 250, 125, 

62.5, 31.2, 15.6, 7.8 pg lL-8/ml. The mean of six different measurements of each standard 
dilution v\;in plotted <x-axis) vs. the mean optical density measured <y-axis). Results were 
plotted usinj! the KC3 software package (Bio-Tek instruments; Winboski, VT). 
" CViclilctcni of variation (CV): From the eight data points for each concentration of the 
10 stahtidrd curve. CoelTicient of Variation (CV) - Standard Deviation/Mean was calculated to be 
6.9, 6.4, i 1 . 1 . ibll ;*4.5, 4.4, 13.0 and 34.1%, respectively for the standard curve concentrations, 
listed above. Points with a CV of greater than 13% were not Utilized for this study. ' 

• k: " Examples 

15 Example J 

"Relative Oitantitdiiye Reverse 

A method to evaluate novel genes (ESTs) as diagnostic biomarkers- 

The reverse transcription^polymerase chain reaction (RT-PCR) protocols described in the 
lOh/'iV'-^^VviAg examples were developed as a means to detennine the relative abundances of mRNA 

species that are expressed in various tissues, organs and cells, This prptoc^ 
■ . 'X ; ; as applied prostate tissue in US Application SbriatNoi :0^6 relevant part 

hei^m by reference, -The protocols used to meet this need mxjst be robust, reproducible relatively 

of resources; rapid and hay^ a high ttepii^ipwt rate, 
25 Relative quantitative RT-PCR has the technical fe^ in theory, meet all of th^ criteria, 

dn praeHce there -are six important barriers to implementing an RTtP£R based 3£S|y that, 
^; compaires the* relative abundances of imRNA speTcies., The protocol described herein addresses 
v ^ H ea6^pf %^e six barriers and has; permitted Ae r^i^tio^o£,lhe j^tent^ this 
• ^ 4 ^j(>pK(4ationr Although the present ex?uripje is drawjv to the identification and confinnation of 
30h < T differeritial expression in various physiological states in prostate toe, the method^ described 
herein may be appl ied to any type of tissue, and particularly to peripheral bipod cells to provide a 
sensitive method of identifying differential expression. 
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The inventors have described the examination of candidate genes by this method that 
were partial cDNA fragments identified by RNA fingerprinting methodologies. This necessitated 
development of a relatively quantitative approach to independently confirm the differential 
expression of the mRNAs from which these partial cDNA fragments were derived. The key 
objective of the described screening protocol is the assessment of changes in the relative 
abundances of mRNA. 

The gene discovery program previously described is focused on analysis of human tissue 
and confirmation must be performed on the same biological material. Access to human tissue for 
isolation of RNA is limited. This limitation is especially problematic in Northern blots, the 
traditional means to determine differential gene expression. Northern blots typically consume 
roughly 20 ug of RNA per examined tissue per gene identified. This means that for the average 
size of tissue sample available, only 1-5 Northern blots can be performed before all of the RNA 
from a tissue sample is completely consumed. Clearly Northern blots are seriously limited for 
primary confirmation of discovered genes and consume extremely valuable biological resources 
1 5 required for gene discovery and characterization. 

Because of such limitations on the amount of available tissue, and because of the need for 
high throughput and rapid turnaround of results, a two tiered assay protocol was developed that 
is technologically grounded on reverse transcription (RT) of RNA into cDNA followed by 
amplification of specific cDNA sequences by polymerase chain reaction (PGR). This coupling of 
20 techniques is frequently referred to as RTrPCR. 

One advantage of RT.PCR is that it consumes relatively small quantities of RNA. With 
20jtg of RNA per examined sample, the amount of RNA required to perform a single . Normem 
y blbt experiment, 50-200 RTRPCR assays may be performed with up to four data pdints per assay. 
Another advantage is a high throughput, eight independent experiments wWcte examine eight 
different mRNA species for differential expression may be perfonned simmt^epu^ly in a single 
PGR machine with 96 wells. A single individual Skilled iri this technique may thereby examiiie 
and 1 evaluate eight genes per tfay without sigmficant time constraints. By comparispto, even if 
RNA 6f sufficient quality a 
sirnffarl^ skille individual performing Northern blots would be hard, pressed to exahflne and ^ 
evaluate eight genes per week. In addition to the lower throughput mte of Northern blots, eight 
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Northern blots per week would require the consumption 6f about 400jiCi of P per week. While 
riot dangerous to use in the hands of a skilled individual, 32 P is certainly inconvenient to use. RT- 
PCR avoids the use of radioactive material 

An addition^ advantage of RT-PCR Over Northern bfots as a technological platform for 
5 evaluating the relative expression of mRNA species is that RT-PCR is much less sensitive to 
differences in quality of the RNA being examined. The human tissues described were removed 
from patients for treatment purposes and were only incidentally saved for further studies. "Hence 
the RNA, an extremely labile molecule, is expected to be at least partially degraded; Because the 
RNA is separated bynsize 6n a gel in the Northern blot assay, partially degraded RNA appears as 
10 a smear, rather than discrete bands. By contrast, RT-PGR amplifies only a section or domain of 
an RNA molecule, and as long as that portion is intact, the size or degradation state of the entire 
mbletule is irrelevant. As a result, RNAs that are identical except that they vary by degree of 
partial degradation will give much more variable signals iri a Northern blot thari they will in an 
RT-PCR. When samples are of variable quality, as is often the case in human studies, the relative 
1 5 sensitivities of the techniques to variation in sample quality is'ah important consideration. 

In the practice of this method, total cell RNA is first converted into ct>NA using reverse 
transcriptase prirried with random hexamers. This protocol results in a cDNA population in 
which eack RNA has contributed according to its relative proportion in original total cell RNA. 
If two RNA species differ by ten fold in their original relative abundances in the total cell RNA, 
20 then the cDNA derived from these two RNAs will differ by ten fold ih their relative 
abundances in the resulting population of cDNA. This is a conservation of relative 
proportionality in the conversion of RNA to cDNA. 

Aiiother consideration is the relative rates of amplification of a targeted cDNA by PCR. 
" Xt < <• iti • theol^ ihe amotirit of ^ri ai^ified product synfliesized ^ PCR will be eif&al to M(E C ), 
25 Wherein the mass 6f the target^! cDfe/riibfebules btee '^b^g^M^g-tfp^GR and C is the 
' ! ^ttKltob^^r PCR cycl k ^rfbrriied; E is an efficietidy of aniplifidatiori factor This factor is 
%cAni>le>r varies between 1 and 2. The imptfitMit c&nsMerati On in this asi&y is that over most 
bifa PCk^pMcation, Ewill nearly coi&tant afcd nearly equaHo 2. In PGR Actions that are 
idehtieaf ih evefy W&y except the cDNAs beingu$ed as templates are derived frbAi different total 
30 cell RNAs, then E will have the same value in each reaction. If a cDNA target has an initial mass 
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of M, in one PCR reaction and a mass of M 2 in another PCR reaction and if E has the same value 
in each reaction, then after C cycles of PCR there will be a mass of M,(E C ) of the amplified 
target in the first reaction and a mass of M 2 (E C ) of the amplified target in the second reaction. 
The ratios of these masses is unaltered by PGR amplification. That is Ml/M2= [M,(E b )]/M 2 (E c >. 
5 Hence, there is a conservation of relative proportionality of amplified products during PCR. 

Since both reverse transcription and PCR may be performed in such a way as tp conserve 
^ proportionality, it is possible to compare the relative abundance of an mRNA species in two or 
more total ceil RNA. populations by first converting the RNA to cDNA and then amplifying a 
fragment of the cDNA derived from the specific mRNA by PCR. The ratio of the amplified 
10 masses of the targeted cDNA is very close to or identical to the ratios of the inRNAs in the 
original total cell RNA populations. 

Six major challenges or barriers to be overcome in order to best use RT-PCR tp quantify 
the relati ve abundances of RNA are as follows: 

15 1) Degradation of RNA must be minimized during RNA preparation.. 

2) Genomic DNA must be eliminated. 

3) RNA must be free of contaminants that might interfere with reverse transcription. 

4) The efficiency ;of ? RT is variable. cDNAis, not RNA, must be normalized for equal 
concentrations of amplifiable cDNA. i;:= 

20 5) Lirnited linear i^ge require multiple^ samphng points in any amplification curve. 
6) Tube to tube variability in PCR .,,./.' 

1- A/It is;the development Qf techniques to overcome these ; barriers and Jo provide a sensitive 
and accurate method of quantitative that is applicable to any tissue type, or c^ll type 

25 s such as peripheral bipod cells^ or p]^ysiolqgicaLsta:te that is a part of the present invention. 

;pr-.f*- TTie fi^ three ^arrier^ successful RT-PCR are all related to the qualij^ ? of ^ RNA 

address the ,.^r^^t^;barpers' as;. 
:ri:;k^e$fmh^ iix the last se^iv These- ar& ^ ^quirem^nts that deg^atjU% of: M££ inust be 
^; nrinhrii^ apd that genomic DNA m 
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Two preferred methods for RNA isolation are the guanidinium thiocyanate riifcthod, 
which is well known in the art, and kits for RNA isolation manufactured by Qiagen, Inc. 
(Chatwofth, C A), with the kits being the most preferred for convenience. Four protocols are 
performed On the RNA isolated by either method (Or any method) biefofe the RNA is be used in 
RT-PCR. 

The first of these four protocols is digestion of the RNAs with Diiasel to remove all 
genomic DNA that was co-isolated with the total cell RNA. Prior td DNasel digestion, the RNA 
is in a particulate suspension in 70% ethanol. Approximately 50 jag Of RNA (as determined by 
OD 2 60728o) is removed from the susjpehsion and precipitate*!, this RNA is resuspended in DEPC 
treated sterile wateir. to this Is added 10X E)j4asel buffer (200 nfitf tris-HCI; pH 8.4, 20 mM 
MgCl 2 , 5^0b mM KC1)V 1 0 units of RNase Inhibit^ Cat#155 1 8-0 12) and 20 units 

of DNasel (GIBCO-BRL # 18068-015). the volume is ^dj^ted to 50 fil with additional DEPC 
treated water, the reaction is incubated at 37*C for 30 minutes. After t)NaseI digestion the 
RNAs are organic solvent-extracted with phenol and chloroforrri followed by ethanol 
precipitation, this reprteerits the secOhd ethanol precipitation of the isolated RNA. Empirical 
observations suggest that this repeated prebipitatiori improves RNA performance in the RT 
reaction to follow. 

Followiftg IJNasel digestion, an aliquot of the RNA suspensioh iti fethariOl is remdvfed and 
divided into thirds. A different procedure is performed on 6k6h Orikbf the aliquot thirds, these 
three procedures are: (1). Ah OD 2 6o/280 is obtained using a standard jfrotoebl and is used to 
estimate the amount of RNA present and its likely quality. (2). An aliquot is run out On an 
agarose gel, and the RNA is stained with ethidium bromide. Observation that both the 28S and 
18S RNAs are visible as discreet bands and that there; is little Staining above tlie point' at Which 
the 28S rRNA migrates indicate that the RNA is relative^ iiit^ 

performance that the examined RNAs be completely free dt^aitfal dej^da^iii, it fcito^ to 
determine that the RNA is not so degraded as to significantly eifect die appearance I of the 28S 
rRNA. (3). The total cell RftAs are rim using a ^CR^basfelj ^4hat fe&ifii^ that tbe 4 DNaseI 
treatment actually digested the contaminating genomic DNA to completion. It f is very important 
to confirm complete digestion of genomic DNA because genomic DNA may act as a t&nplate in 
PCR reactions resulting in false positive signals in the relative qu^fitetiVe* 
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described below. The assay for contaminating genomic DNA utilizes gene specific 
oligonucleotides that flank a 145 nucleotide long intron (intron #3) in the gene encoding Prostate 
Specific Antigen (PSA).This is a single copy gene with no pseudogenes. It is a member of the 
kallikrein gene family of serine proteases, but the oligonucleotides used in this assay are specific 
to PSA. The sequences of these oligonucleotides are: 

5'CGCCTCAGGCTGGGGCAGCATT 3% SEQ ID NO:6 
and 

5'ACAGTGGAAGAGTCTCATTCGAGAT 3\ SEQ ID NO:7. 

In the assay for contaminating genomic DNA, 500 ng to 1,0 ug of each of the DNasel 
treated RNAs are used^as templates in a standard PCR (35-40 cycles under conditions described 
bejow) in which the oligonucleotides described aboye are used as primers. Human genomic DNA 
is used as the appropriate positive control. This DNA may be purchased from a commercial 
vender, A positive signal in this assay is the amplification of a 242 nucleotide genomic DNA 
specific PCR product from the RNA sample being tested as visualized on an ethidium bromide 
15 stained electrophoretic gel. There should be no evidence of genomic DNA as indicated by this 
assay in the RNAs used in the RT-PCR assay described below. Evidence of contaminating 
genomic DNA results in re-digestion of the RNA with DNasel and revaluation of the DNase 
treated RNA by determining its OD 260mo ratio, examination on electrophoretic gel and re-testing 
for genomic DNA contamination using the described PGR assay. 

i The standard conditions used for PCR (as mentioned in the last paragraph) are: 
IX GIBCO-BRL PCR reaction buffer [20 mM Tris-Cl (pH 8.4), 50 mM KC1) ; . 

l;5 mMMgCl 2 .' , ■ . 

200 uM each of the ; four dNTPs , : o 

200 nM each oligonucleotide primer 
25 . concentration of template as appropriate 

V "nits of Taq polymerase per 1 OOfdof reaction volume. . ; i v 

Using these conditions, PCR is performed with 35-40 cycles of: « 
; ; ^4?Cfor45sec ']■'.■"' .[ /' •- "..'..7J-V, ■ 't; '"..-1 . ' 

: ; 55^60«>Cfor45 sec • .' ■ '-. ^C---'^--- .'T'/. - V7'v .-• 

30 72°C for 1:00 minute. ' v , ' .". . 



20 



Wd 98/24935 M2T/US97/22105 

81 

The protocols described in the above section -permit isolation of total cellular RNA that 
overcomes two of the six barrieris to successful RT-PCR, i.e. the RNA is acceptably intact and is 
fr£e from contaminating genomic DNA. 

Reverse trauiscriptase^, also called RNA dependent DNA polymerases, as applied in' 
5 currently used ffi known to be less processive than other 

^commonly used nucldc aicid jwlymerasesl It has been observed that not. only is the efficiency of 
'SoSVersioh of RNA to cDNA relatively inefficient, there isialso^ s^ in the 

efficiency of 6DNA synthesis between reactions that use RNAs as templates that otherwise 
appear indiistinguishablel The sources of this variation are hot well characterized, but empirically, 
10 it has been observed that the efficiencies of some reverse transcription (RT) reactions may be 
improved by repeated organic extra ethahol precipitations. This implies that some of 

thb vimati^ the DNasel 

^eMrtieht described aboVe may be aiding the RNA to an 

additional cycle of extraction with phenol and chloroform and ethahol precipitation. 
15 Cbiitanadriatioh of the template RNA with inhibitors of RT;is an important barrier to successful 
RT that is partially overcome by carefiil RNA prepa^tion and repeated organic extractions and 
ethanol pfecipitiationsv m ^ r ^ 

Reverse transenptibii reactions are performed uising the Superscript!" Preamplification 
System for First Strand C©KA Synthesis kit which i^ M 
20 ' ; \£if<^ Su^rsciipt^ , is ^ reverse u 

} ttaitise^^ 
• * - pi^e^ivity. lirtbeptefc^ 

£©NA^s>^ a 
mjxtoire of tahdOhi hexamers (or other small oligonacleoMdes of random sequence) and oligo dT. 
25' The'addliidft Ofbligo dT increases the efficiency, of conversion of RNA to c©l^A proximal to the 
■ y: ^--^p^A i ^rks tempiate^either <5 Or 10 mic*bgrams of ^A isvused (dependmg o» Avaaability). 
' ' l M -After 1he RT-Teaction'has been completed according; to; the. protocol provided bysGI-BCO?BRL, 
lHthe>kf Action fe^ Water to a final volume of. l QQ;jiL i&):&n NUfr. ^ U *>**-'•■.• 

Even with the best prepared RNA aiidfeffie to6^ ; ^ be 
30 significant variation in the efficiency of RT. This variation would be sufficiently jgreat that 
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cDNA made in different RTs could not be reliably compared. To overcome this possible 
variation, cDNA populations made from different RT reactions may be normalized to contain 
equal concentrations of amplifiable cDNA synthesized from mRNAs that are known not to vary 
between the physiological states being examined. In the present examples, cDNAs made from 
5 total cell RNAs are normalized to contain equal concentrations of amplifiable b-actin cDNA. 

One ul of each diluted RT reaction is subjected to PCR usin^oligonucleotides specific to 
p-actin as primers. These primers are designed to cross introns, permitting the differentiation of 
cDNA and genomic DNA. These 0-actin specific oligonucleotides have the sequences: 

5' CGAGCTGCCTGACGGCCAGGTCATC 3% SEQIDNO:8 
10 ' ■ • . an< L - • ' ..; ," 

5 ' GAAGCATTTGCGGTGGACG ATGGAG 3% SEQID NQ:9 
PCR is performed under standard conditions as described previously for either 19 Or 20 
cycles. The resulting PCR product is 415 nucleotides in length. The product is examined by PCR 
using agarose gel electrophoresis followed by staining with ethidium bromide. The amplified 
cDNA fragment is then visuahzed by irradiation with ultra violet light using a transiUuminator. 
A white light image of the illuminated gel is captured by an IS- 1000 Digital Imaging System 
manufactured by Alpha Innotech Corporation. The captured image is. analyzed using either 
version 2.0 or 2.01 of the software package supplied by the manufacturer to determine the 
relative amounts of amplified p-actin cDNA in each RT reaction. , 

To normalize the various cDNAs, water is added to the most concentrated cDNAs as 
determined by the assay described in the last paragrapfri BCR using 1 ul of the newly rediluted 
and adjusted cDNA is repeated using the p-actin ohgpnucleotides as primers. The number of 
cycles of PCR must be increased to 21 or 22 cycles in Order to compensate for the decreased 
concentration ^ 

adjusted by dilution to contain roughly equal concentrations of amplifiable cD^ Sometimes 
tms proce^ must be repeated to give acceptable final iK>rma}izationi ; By diyidmg the average 
d^ti^density of all observed bands by that of a particular band, a normalization stafistiemay be 
created that will permit more accurate comparisons of the relative abundances of RNAs 
examined in the normalized panel of cDNAs; v r 1 
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1 

Once the normalization stati^ics are derived- PGR may be jxrfbrmed lising different gene 
specific oligonucleotides as primers to determihe the relative abundances of other mRN As as 
repfes&ited as cDNAs in the normalized p&nel of diluted RT reaction prodtietis. The relative 
intensities of the bands is then adjusted and normalised 16 p-actih expf essibri by multiplying the 
5 intensity quantities by ^ the noimalization statistics derived. 

; ' ' " In the next section an RT-PCR assay is discussed that tises pooled cDNAs arid is more 

likely to capture data from PCRs while in the linear portions of their amplification curves. The 
eifor caused by observing PCRs after the linear portion of PGR is in the direction of 
quantitatively underestimating iriRNA abundance differences. To determine quantitative 
10 differences? in mRNA expressibn, it is necessary that the data is collected in the linear portion of 
the respective PGR amplification curves. This requirement is met in the assay described in 
following paragraphs. * : ' : 

The i^ twb barriers to 

use of pooled cDNAs as templates in RT-PGR. In practice, the protocols using pooled templates 
15 are usually perfoim described above: 

There are two additional baririere to relative riiRN A quantitation with RT-PCR that 
frequently compromise interpretations of results bbtaihed by this method* The first of these 
involves the need to quantify ^ the PGR is still in the linear 

v jk>rtioh of the process where "E" behaves as a constant and is nearly equal to two. In the ^linear" 
20 portion of the amplification curve, the log of the mass of the amplified product m directly 
^ the end of 4he< £GR' ptocess, "E" is not constant. Late in 

^Cl^'^^eclinfes with each additional eycle until there fe^Sndfease in PCfe' product ittass with 
/: ^^^n^^ybles: • - • • :: - ' ^< >'->^? ' ^ - - -7- ^ ' ■ 1 V ■ J ' 5 - - 
" ^ TThe nibst iirij^ttmit ^ edSon Stfhy the 5 efficidtfc^ df feplific^tion decreases high PCR 

20- m fcyde humber, may be that the t^6a^ation of the PCRpbduets bectMies hi^ enough that the: 
^- Ifrmds St the product begin ttf anffeal t&^M oth^ '^fo^ at 

' U ^ M ? the in^ TMi eompetition 

' iA " : "^b^^n "the PGR product stiifods^and ^ primed cfeates a^ decrease ih PCR 

^ ^lf^ieric# the A^eiicy of ia^iiiBcaticai is tdecafeased is called the 

30 "plateau" phase of the ^plii^^ as a constant and the 
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PGR begins to move towards the plateau phase, the conservation of relative proportionality of 
amplified .products during PCR is lost. This creates an error in estimating the differences in 
relative abundance of an mRNA species occurring in different total cell RNA populations. This 
error is always in the same direction, in that it causes differences in relative mRNA abundances 
5 to appear less than they actually are. In the extreme case, where all PCRs have entered the, 
plateau phase, this effect will cause differentially expressed mRNAs to appear as if they are not 
differentially expressed at all. \ 
Tocontrpl for this type of error, it is important mat the PCR products quantified in the 
linear portion of the amplification curve. This, is technically difficult because currently used 
10 means of DNA quantitation are only sensitive enough to quantify the PCR products when they 
are approaching concentrations at which the product strands begin to compete with the primers 
for annealing. This means that the PCR products may only be detected at the very end of the 
linear range of the amplification curve. Predicting in advance at what cycle number the PCR 
products should be quantified is technically difficult, 
15 Practically speaking, it is necessary to sample the PCR products at a variety, of cycle 

numbers that are believed to span the optimum detection range in which the products are 
abundant enough to detect, but still in the linear range of the amplification curve. It is impractical 
to do this in a study that involves large numbers of samples because the number of different PCR 
reactions and/or number of different electrophoretic gel&that must be run becomes, prohibitively 
20 large. 

; To overcome these limitations, a two tiered approach, was designed to.relatively quantify 
. mRNA ahundance levels .using RT-PCR. In ^.^i^-dgflfa: of cDl^As ; produced by 
combining equal amounts of normalized cDNA are examined to deterntine how mRNA 
v . abundances vary in the average individual with a particular phy^io^gical stete. This reduces the 
25 ^ number of compared samples to a very small number such as two to four. In the studies v 
described herem, : two pools are 0 ^ pools of . normal individuals and those 

indivaduals with metastatic prost#eicancer. Ea^^ [ 
While this approach does not discrintinato different it may easily discern 

broad patterns of differential expressions ^ great advantage ofexaXmn^^ is that 

30 it permits many duplicate PCR reactions to be simultaneously set up. 
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The individual duplicates may harvested and examined at different cycle numbers of 
PCR. In studies described below, four duplicate PCR reactions were set up. One duplicate was 
collected at 31, 34, 37. and 40 PCR cycles. Occ^ionally, PGR reactions were also collected at 28 
cycles. Exaniining die PC Rs at di ffcrcnt cycle numbers yielded the following benefits. It is very 
5 ' likely that at least one ol the RT-PCRs will be in the optimum portion of the ariiplification curves 
to reliably compare relative niRNA abundances. Iii addition, the optimum cycle number will be 
known, so that studies with much larger sarilple sizes are much more likely to succeed* This is 
ttife second tier of a two tiered approach that ha& been taken to relatively quantify mRNA 
abundance levels using RT-PCR. Doing tKe RT-PCR \vith the pooled samples permits much 
10 more efficient application of RT^PCR to the samples derived frbm individuals. A further benefit, 
also as discussed bfeiW/ ttibe to tube variability in PCR may be discounted and controlled 
because most skidies yield multiple data points due to duplication. 

Like the previously described protocol involving individuals, the first step in this protocol 
is to normalize the pooled samples to contain equal amounts of amplifiable cDNA. This is done 
15 using Oligonucleotides ihat 'dfr^ In this example, a PCR 

amplification of a cDNA Fragment derived from the 0-actin mRNA froto pools of normal 
individuals and individuals vHth ihfetastatic prostate cancer wias performed. This study was set up . . 
as four identical PCR reactions The productS-of ^ese PCRs were collected and electrophoresed 
v after ^ 

20 x that thfe JPCRs were still in the linear ranges of their amplification curves at 22, 25 and 28 cycles 
but that they 1^ linearity at 3 1 cycles. This is knovvii because the ratios oftheband Intensities 
remain constant and internklly t^i^ste& folr &&i&fa r obtained #oii* 22, C25 *£rid 28 Cycles, but 
r ' iheSe ratios became distorted at 31 cycles. This Quantitation will also permit the derivation of 
ndrihali^ng statistics fot the threb j^Is ita^SVe to each other in exactly the same ftiaiMer^as was 
25 ^ r 

^ 1 Study then repeated usiri^g^sp^ific priiners for a gene other ?tHaif p-actin. The 

J *inteh^ bands were q^^ted using the IS 1000 a^d ri6rmaliz«l the p-actin 

•^^signais; : -' ;] ":. ■^■r-mi <'%=,f-%: <i?.-.',-- l .;; r r i\ ^iV • ; V ^<fu.,^.: - ^ 

f k - The teritral qUestion to be answered in analysing this data is whether the PtpRs have been 
30 examined iri the linear portions of their amplification curves: A test for this may Be devised by 
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determining if the proportionality of the PCR products has been conserved as PCR cycle number 
has increased. If the ratio between the two pools of a given PCR product remains constant with 
increasing cycle number, this is strong evidence that the PCRs were in the linear portions of their 
amplification curves when these observations were made. (This is better conservation of 
proportionality than is frequently observed. In some studies, data was excepted when the ratios 
were similar but not identical.) This conservation of proportionality was lost at 40 cycles. This 
indicates that these PCRs are nearing the plateau phases of their amplification curves. 

The final major barrier to quantifying relative mRNA abundances with RT-PCR is tube 
to tube variability in PCR. This may result from many factors, including unequal heating and 
cooling in the thermocycler, imperfections in the PCR tubes and Operator error. To control for 
this source of variation, the Cole-Parmer digital thermocouple Model # 8402 00 was used to 
calibrate the thermocyclers used in these studies. Only slight variations in temperature were 
observed. . 

To rigorously demonstrate that PCR tube to tube variability was not a factor in the studies 
15 described above, 24 duplicate PCRs for p-actin using the same cDNA as template were 
performed. These PCR tubes were scattered over the surface of a 96 well thermocycler, including 
the corners of the block where it might be suspected the temperature might deviate from other 
areas. Tubes were collected at various cycle numbers. Nine tubes were collected at 21 cycles. 
Nine tubes were collected at 24 cycles, and six tubes were collected at 27 cycles. Quantitation of 
the intensities of the resulting bands with the IS 1000 system deteimined that the standard error 
of the mean of the PCR product abundances was ±13%. This is an acceptably small number to be 
" discounted; as am^ 

The RT-PCR protocol exanumng pooled cDNAs is internally cpntrplled for tube to tube 
variabUity that might arise from any source. By examining the abundance of the PCR products at 
25 several different cycle numbers, it may be determined that the mass of the expected PCR product, 
is mcreasing appropriately with increasing PCR cycle number. Not only does this demonstrate 
that the PCRs are being examined in the Imeari phase 

it demonstrates that each reaction with the same template is consistent with the data from the 
surrounding cycle numbers.. If there was an unexplained source of variation, the expectation that 
PCR product mass would increase appropriately vrim increasing cycle number would not be met ; 
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This would indicate artifactual variation in results. Internal duplication and consistency of the 
data derived from different cycle numbers controls for system derived variation in tube to tube 
results. 

As described in the preceding paragraphs, the RT-PCR protocol ; using pooled cDNA 

5 templates overcomes the last two barriers to effective relative quantitative RT-PCR. These 

barriers are the need examine the PCR products while the reactions are in the linear portions of 

their amplification curves ^nd the need to control tube to tube variation in PCR. The described 
" . ■ . ... 

A protocol examines PCR products at three to four different cycle numbers. This insures that the 

f PCRs are quantitiated in their linear ranges and, as discussed in the last paragraph, controls for 

1 6 : r vp^&^bfe tube to tube variation. j ■ ■ 

v ^ One • fihal question is whether p-actiri is an appropriate; internal standard for niRNA 

quantitation, p-actin has been used by many investigators to normalize mRNA levels. Others 
^ itself differentially regulated and therefore unsuitable as an internal 

hontializatibn standard; In the protocols described herein differential regulation of P-actin is not > 

15 si cbnce5rn. More than fifty genes have been examiried for differential expression Rising these 
protocols. Fewer thab half Were actually differentially expressed. The other half were regulated 
similaWy to p-actiri within the standard error of 13%. Either all of these genes are coordinate^ 
differentially regulated with P-actin, or none of them are differentially regulated. The ppssibility 
that all of these genes could be similarly and eoprdinately differentially regitfat^d with p-actin 

20 seems highly unlikely. This possibility has been discounted, V , v 

t% * , p4abtih has also been criticized by some as an internal standard in PCRs because of the 
iiarge iiumber of pseudogeries of p^acthi that* occur in mammalian genomes. This is ; not a 
-y --^^i&Mxfo iii the described assays because all of the RNAs used herein aye demonstrated: to be 
free of contaminating genomic DNA by a very sensitive 

25 number of PCR needed to detect p-actin cDNA from the diluted RT reactions, usually between 
19 and 22 cycles, is sufficiently low to discount any contribution that genomic DNA might make 
to the abundance of amplifiable pectin tefnpl^e$v^^ ^ •;• ^ f : < ; ^ : 
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Example 2: 

Identification oj 'Markers of Metastatic Prostate Cancer by Use of 
RNA fingerprint ingby PC R primed with oligonucleotides of arbitrary sequence. 

RNA fingerprinting by I»CR, primed with oligonucleotides of arbitrary sequence^ was 
performed on RNAs isolated from peripheral human blood. Bands which appeared to be 
differentially expressed Were cloned. 

For this study, total cell RNA was isolated from buffy coat cells as described above. cDNA 
was made from One to five U g of each isolated RNA. All cDNAs were normalized for similar 
amountsof 6-actincDNA by RT-PCR, RT-PCRpn>ducts were electrophorescd through aga^ 

For relative quantitative RT-PCR with an external standard, quantitation of band intensities 
on ethidium bromide stained gels was performed using the IS-1000 image analysis system 
manufactured by the Alpha Innotech Corp. A normalizing statistic was generated for'each cDNA 
sample, as the average of all B-actin signals divided by the B-actin signal for each cDNA sample 
respectively. Data for each sample was then normalized by multiplying the observed densitometry 
observation by the individual normalizing statistics. Normalized values predict differences in the 
steady state abundances of the respective mRNAs in the original total cell RNA samples. 

The nucleotide sequences of all cloned PCR products were determined by dideoxy 
termination sequencing using' either the ABI of Pharmacia automated sequencers. '.'.*. ,,-;}.. 

This^rofocol resulted in the discovery of an mRNA species that was 2-1 fold less abundant 
in the peripheral blood of metastatic prostate cancer patients than in the peripheral blood of normal 
mdividuals of both sexes. The sequence of this band, referred to as UCBP Band #35 (SEQ ID 
NO:l), matches ah EST derived from afetal brain cDNA library {GenBank Accession #T03013). 
Down regulation of this band in the peripheral blood of metastatic prostate cancer patients was 
confinried by relative quM ./. 



Idei ^e<rt°n<rf Markets^ 
Use ofRNAfingerprintingbythe Pairwise Sequential Method. 

RNA fmgerprintingwas used to identify differentially expressed RNA species according to 
the pairwise sequential method of McClelland et at (1$>94), as modified to use larger (17-25 mer) 
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arbitrary oligonucleotides. PCR amplification products were labeled using cc-32P-dCTP and Were 
■Visualized by autoradiography after electrophoresis on denatiiringpolyacrylamide gels. A number 
of bands appeared to be differentially expressed, and were cloned as described above. 
UC B^d W321 was confirmed by 
5 prostate cancer patients, with a four-fold decrease observed compared with normal individuals. 
The DNA sequence of Band #321 does not match any known sequences ill the GenBank database. 
It therefore represents a previously undescribed gerie product. 

OC Band #302 and UC Band #325 were both observed toffee up regulated in the peripheral 
blood of rhetastatic prostate caiicer patients. UC Band #302 is identical in sequence to a portion of 
10 the sequence of elongation factor 1-a (GenBank Accessioh #X03558). This band was modestly 
iiifreased beiweeri 1 .6 arid 2-fold in metastatic cancer patients compared with normal individuals. 

ticiB^d #325 \^ found to consist of two different alternatively spliced forms of mRNA, 
encoded by the interleukin-8 (IL-8) gene. UC Band #325-1, the previously identified mRNA 
species of IL-8 (Genbank Accession #Y00787), is approxiiiiately seven-fold more abundant in the 
15 peripheral blbod of metastatic prostate cancer patients. The alternatively spliced IL-8 mRNA, 
containing intfori #3 of the IL-8 gene (Genbank Acbession #M28130) is up to seven-fold less 
abundant iii the peripheral blood of metastatic prostate cancer patients. Fig. 1 A shows relative 
qu^titative ^T-PCR of the differential egression of IL-8 (-UC235) in f>eripherai blood of 
patients with met prostate cancer (M) an& hotm^^ (N) at different PCR cycles 

20 (cy). The two alternatively spliced forms of the IL-8 mRNA are observed, The upper band (int.+) 
includes intron 3 in the mature mRNA. ^Thfe iower temd (int.-) lacks intron 3. Fig. IB shows 
relative quantit^^fel^ 

of patients with metastatic prostate cancer in lanes 1 -5 and a pool of normal individuals (N). The 
alternatively spliced fohiis of the IL-8 mRNA observed are differentbetween normal individuals 
25 and tfc6$e with prostate cancer. Overall, there fe an approximately 30-fold change in the ratios of 
thfe tWd spliced forms of II>8 mRNA in individib^s with metastatic prostate catice^r compared with 
nom^H 

^ As described abovfe, an increased expreisSibnof IL-8 mRN^^ b^ii^vim^fy 
cafic&r p>aHkrts. However, this represents the fitst finding of W^tisafnMvely of EL-8 

30 mRNi^ containing intron 3, that is significantly more abundant in iiorrdal ihdi^duals compared 
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with metastatic prostate cancer patients. These results are surprising in view of previous reports 
which had failed to find any alternatively spliced forms ofIL-8 mRNA in normal individuals or 
cancer patients. 

It will be recognized that the genes and gene products (RNAs and proteins) for the above 
described markers of metastatic prostate cancer are included within the scope of the disclosure 
herein described, It will also be recognized that the diagnosis and prognosis of metastatic prostatic 
cancer by detection of the nucleic acid products of these genes are included within the scope of the 
present invention. Serological and other assays to detect these mRNA species or their translation 
products arc also indicated, h is obvious that these assays are of utility m^agnosing metastatic 
cancers derived from prostate, and other tissues. 

Most .signi ficantly, these Examples demonstrate the feasibility of using RNA fingerprinting 
toJdentify mRNA species that are differentially expressed in the peripheral bipod of patients with 
asymptomatic diseases or in patients with symptoms that are insufficient for a definitive 
diagnosis. It will be appreciated that mis technique is applicable not only to the detection and 
diagnosis of prostate and other cancers, but also to any other disease states which produce 
significant effects on lymphocyte gene expression. Uses which are contemplated within the scope 
of the present disclosure include the detection and diagnosis of clinically significant diseases that 
retires medical intei^ention, including but not limited to asthma, lupus erytlii^matosis, 
rheumatoid arthritis, multiple Sclerosis, myasthenia gravis, autoimmune thyroiditis, ALS, interstitial 
cystitis and prostatitis. 

)■ . i TABLE 2 ~' ' \7;_.. , 

Genes.. 

; 'WiX-i Metastatic Prostate Cancer Rektrve to ^ 



Name of - 
cDN A Fragment 


Sequence 
- Determined, 


Confirmed 
by RT-PCR 


Previously . 
Known 


UCPB35 ■ 




Yes,.. ^ G 


B#T03O13 


VG 302SEQIDNO:3 






EFl-a : 


Ue321 S|QIDNO:2 




Yes 


No 


UC 325-1 SEQ ID NO;4 


. r V- ;t; Yes ,; 




B#Y0O787 


UC 325-2 SEQ ID NQ:S, . 




. Yes.,*,';, . . ■ . 


II>8 



nworv-vrirv ^wr% flP9«nui i ^ 
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TABLE 3. 

Oligonucleotides used in the felativie quantitative RT-PCR portion of these studies. 
Oligonucleotidesused to examine the expression of genes: 



" 5' TG^AAACtTTCACC^ 

5' CTTGTGACTTGCTTTGATAGAATG3', SEQ ID NO: 1 1 
10 UCBand#302(elongationfactor 1-a). 

1 5' GACAACATGCTGGAGCCA AGTGCy, SEQ ID NO: 1 2 

- v S'ACGACCAATirrGTAAGAACATCGrnVSEQIDNO^S 
UC Band #321 (previously uncharacterized gene). 

5* TGTCCAGAGATCCAAGTGCAGAAGG3', SEQ ID NO: 14 
15 5' GAGCTCCAGGAGACAGAAGCCATAG3*, SEQ ID NO: 15 

UC Band #325-1 (IL-8). 

,. 5*gg ; : 

j 5 • TGGCAACCCTACAAGAGACC3',>SEQ ID ~HQi\l 

UC Band #325-2 (IL-8). 
20 5' GGGCCCCAAGGAAAACT3', SEQ ID NO: 18 

5^TGGCAACCCrACA^CAGAeC3VSE0lbNb:19 '-' 



25 v 1 



IGTGG^GA^GAG3V^Q ID NO^ k; 
- Asp^ra^e SyntHetase'(AS) ^ * r - ; * * — • 

. Hf^k , > f S' ACATTGAAGCACTCCGGGACjSVSEQ KXNOs20 ,. 5 /; .; mf ^ ; 
5* AGAGTGGCAGCAACCAAGCT3V SEQ ID NO:21 
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Example 4: 

DNA Sequences of Markers of Metastatic Prostate Cancer 

The DNA sequences of the markers of metastatic prostate cancer were determined by 
Sanger di deoxy sequencing as detailed above. The identified sequences are provided in Table 4. 

TABLE 4. 

UNA Sequences of Markers of Metastatic Prostate Cancer: 
10 UCPB Band #35 (SEQ ID NO:l) Matches a fetal brain EST, GenBank Accession # T0301 3 

15 A AGC A A GTC AC AAGCCCCTGCCA ATCA 3, A1 1 A1 A1CAOTCTGTTGCATT CTATCA 
UC Band # 321 (SEQ ID NO:2) previously uncharacterized Gene 



25 



30 



20 T^rrr^^ 

cct^ga^ct^ 



UCBand # 302 (SEQ1D NO:3) Hwan ^ 
ii GG ^l GGTGACAACATC ^ 



r.ArrrPAT^ ™ A r - ~ . GGAACCACGCTGCTTGAGGCTCTG, 



35 AA^rAx^^C^ 1 1UU1UU1AI IWIACTGTTGCCTGTTTGGCCGAATTGGAA 
35 AACTGGTGTTCCTCCAAACCCCGGTTATGGTGGGTTTCCTCCTCCTTGGA 3' 

UC Band #325-1 (SEQ ID NG^XHurt^ #Y00787 
GAGGAGTGGACCAACACTGGCGCCAAACACAGAAATTATTGTAAAGCTTTCTGAT G 
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gAagagagctctfgtctgggccgcaag 

tttttgaagagggctgagaattcataaaaaaattcattgtgtgtggtatccaagaat \ 
cagtgaagatgccagtgaaacttcaagcaaatctacttc^ 
tgggtctgttgtagggttgccagttgtt 3' 

UC Band #325-2 (SEQ ID NO:5) Human IL-8 mRNA Containing Intron #3, 

5*gcttg&gccccaag 

10 GTAAGTTATATATTTTTGAATTTAAA 

AAGTCAGCCTATAAATTTCTTTCTGTTC 

TGGTTAAAAAAAA^GGAATAGCATCAATAGTGAGTGTGTTGTAGTCATGACCAGA 

aagaccatacatagtttgcccaggaaattctgggtttaagcttgtgtcctatactctt 
agtaaagttgtttgtcactcccagtagtgtcctatgttagatgataatgtctttgatc 
i5 tccctalyt atagtt^g ag aatat^ag agcatgtct ^cacatgaatgtcaaagactat 
attgacttttgaagaaggctagtttgcttgttattaaagataggtgatgtita^^ 
gaattttattttaggggtgagaattcataaaaaaattgatx 
aatcagtgaaGatgccagTGaaacttc 
gtgtgggtctgttgtagggttgcca 3' 

20 



Example 5: 

Detection arid Differential 

• Advanced Stage Prostate Carcinomas Using 
25 Comhindtipm of IL-8 with Other Prostate Disease Markers. 

A total of 164 serum specimens from normal 1 men or men with a Biopsy confirmed 
diagnosis of BPH or prostate cancer were studied. These serum specimens wereprovided by. Dir. 
George Wright from the Virginia Prostate Center at the Eastern Virginia Medical School and by 
30 Dr. Robert Vessella from the University of Washington 5 or were normal donors from UroCor, 
Inc. Ml patients were biopsy-confirmed for eimer BPH'^r prostoto carcmoma (sta and 
C only) within six months after PSA serum collection and/or a DRE-positive diagnosis. All, 
■v0 ^patient sera were obtained prior to any surgical or hormonal therapies; The mean age of the total 
v sample was 69.4 ± 8,6 years (range = 37 - 9Lyears),old. r ; , ^ v 

35 -m- ' ' The subset of patients utilized for mulfivariate diagnostip. serum m of 13 

BPH and 64 CaP (Stages A^B^C) cases from theparent population (Marley et al, 1996). All 
< patients in the subset had a^total PSA between 2.0 - 20.0 ng/ml^ which is a,standard,rapge for fft 
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PSA testing (Marley et al. , 1996). Also evaluated were a subset of Stage D CaP patients, with t- 
PSA values ranging from 6.5 - 867 ng/ml. 



D Nn^f ' * Mean Age ±Std. Dev. (Range) 

orma ° < 50 years 

66.4+ 8.6 (37 - 87) years 
74.7±7.8 (61 - 91) years 

CaP Stage C 14 68.9±6.9 (60 - 80) years 

CaPStageB 14 72.3±8.6 (58 - 861 W 



Normal 8 

BPH 55 

CaP Stage A 24 

CaP Stage B 48 



table 5 shows the distribution of the total PSA levels, the f/t PSA ratios, and the UC325 
levels for the 164 patients, broken down by normals, BPH, and Stages A, B, C, & D prostate 
cancer. Only the BPH, Stage A, Stage B, and Stage C prostate cancer patients were included in 
the statistical analysis. • 

TABLE 5 

UC325 Patient Sample Characteristics (n = 164) 



Mean Value ± Std* Dev. 



UC325 Total PSA f/t PSA 

Diagnosts N (pg/ml) <ng/ml) Ratio (%) 

Normal 8 0.2 + 0.6 N/A ti/A 



r , p ^ . * 5 A 6.8 + 6.1 6<9±4>© 21.9 ± 10.9% 

cSsl-l W 19.1 ±10.4 14.6 ±10.5% 

-S-SSS W 13.5 + 9.5 8;8±6:6 11.9 ±5.7% 

Sift n f 19.1±7.9 16:2*76 11^±8;3 v 1 

CaPState D 14 78.9 ±197 244 ±332 12.4 ± 7.1% ^ " 

Table 6 illustrates die ability for f/t PSA ratio at thi^ different cWoffs to ^ 
prostate cancer and BPH in the inventors' patient sample. UC325 aL-8) and t-PSA ^ analyzed 
af smgle Glassiflca^ Regression Tn^ (C>^T) cntbff points for the Sanie outcome. Note- 
tte si ^ficafat iMpr6vem^ by the UC325 (IL-8) 

scfum assay%o detect clinically organ confmed. The combination of UC325 ai>8), fieated as a 
continuous variable, and t-PSA or fit PSA ratio provides a highly predictive multivariate test 
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system to diagnose CaP (clinical stages A & B) withoiit any interference provided by BPH in the 
inventors' patient subset. / if 

TABLE 6 

5 Ability of Senim Testts to Discriminate BPH and CaP. 



. Serum Test 


Cutoff 


Sensitivity 


.Specificity 




p-value 


.p #t PSA Ratio 


n% 


52.9% 


91i9% 


. 0.7905 


< 0.0001 


tin 


14% 


70.1% 


80.0 






»ltt . ' 


20% 


85.1 






' lit/ 


UC325 


9.8pg/ml 


•'\ : 72.4% 


74:5% 


0-7973 ' 


<p.oooi 


Total PSA 


14.8 ng/ml 


17.2% 


98-2% 


0.5995; 


0.0134 


f/t PSA &;UC325 


1 0.69** 


71.3% 


90.9% 


,6.8784 


<0.0001 


Total PSA&UC325 


,0.64** 


62.1% 


, . 85.5% 


0.8069 


<0:000l 



*A11 cutoffs determined using Classification and Regression Tree Analysis (CART) 



♦^Predicated Probability value calculated tising jog^ function 

% To further isubstantiate th^ results of Table 6; individual analysis using Receiver Operator 

10 Characteristic (ROC) curves are provided for each variable. Figure 2 ill^ti^tes the ability of t- 
PSA to distinguish BPH and Stages A, B, and C prostate cancer. Figure 3 shows the ability of fit 
PSA ratio to distinguish BPH and S^igfe^^Ai B, and C prostate cancer. Figure 4 shovvs the,ability 
of l|J£p^(^ prostate cancer. Figure 5 

sbp^s ^e: ability of the combination of UC325 (IL-8) and total PSA (t-PSA) to distinguish BPH 
15 and Stages A, B arid C prostate cahcejr. iFigtire % stiovvS the ability of the q^mb^tion of UC325 N| 
0l^%id *he &t PS'A ^p to distiiiguish betweeti and stages A, B and^^rostate cancen 
It is apparent that Me eombinatioii xxf UC325 measurement with either t-P&A or fJT PSA 
prd^W £^ in sensitivity of detSstidh, while ^maintainm^ a high degree of 

spq^if&a^r> Thus, the combination o^pC325 (IL-8) wi^h other prostate dise^e markers, such as 
20 t-PSA or Ft PSA ratio, provides a si^ificaht advice in tHe detect 
of prostate cancer. 
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Table 7 presents the correlation values for the different serum markers. This table clearly 
shows that the UC325 biomarker provides information which is independent of that provided by 



the f/t PSA ratio. 



TABLE 7 

Correlation Values for BPH vs Stages A, B & C (n = 142) 





Diagnosis 


TdtalPSA 
(ng/ml) . 


fftPSA 
Ratio (%) 


UG325 
(pg/jnl) 


.Age-,. 


Clinical 
Stage 


Diagnosis 
Total PSA 


1.0000 


0.5647 


-0.1912 


0.2262 


0.1590 


0.3497 


(ng/ml) 
f/t PSA 


0.5647 1 


1.000 


-0.2319 


0.5991 


0.0898 


0.3729 


Ratio (%) 
UC325 


-0.1912 


0.2319 


1.0000 


-0.2142 


0.0641 


-0.4126 


(pg/ml) 

Age - 
Clinical 


0.2262 
0.1590 


0.5991 
00898 


0.2142 
0.0641 


i.oooo 

0.0881 


0.0881 
1 0000 


0.2486 
0.1372 


Stage 


0.3497 


0.3729 


-0.4126 


0.2486 


0.1372 


1.0000 



Table 8 clearly; demonstrates a relationship between tumor burden and serum UC-325 
gene product measured by IL-8 assay. Note that as biopsy-confirmed clinical stage of the cancer 
increases, so does mte IL-8 senim marker concentration, whereas the sahie relationship did not 
occur with [t-PSA] or f/t PSA ratio. . ^ ; 
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Example 6: 

Identification o) r Markers \oj 'Metastatic Prostate and Breast Cancer by Use of 
RNA fingerprinting by PCR primed with oligonucleotides of arbitrary sequence* 

5 : RNA fingerprinting displays PGR™ amplified cDNA fragments that represent a sample 

of RNA species derived from a population of total cell RNAs. When displayed side by side, 
... - comparisons of similarly produced fingerprints representing RNA populations from- cells of 
? v differing physiologic states identifies mRNA species whose relative abundances vary between 
the examined physiologic states/ In this study, RNA fingerprinting identified two cDNA 
10 i fragments derived from mRNA species that had higher steady state abundances in the peripheral 
blood leukocytes of patients with recurrent metastatic prostate; cancer as compared to a group of 
healthy volunteers. 

Eight ml of peripheral blood was collected from healthy volunteers, patients with 
clinically and biopsy confirmed BPHj localized and advanced metastatic prostate cancer, and 
15 from patients with advanced metastatic breast cancer. Metastatic prostate and breast cancer 
patients that had failed a primary therapy and had evidence of recurrence of disease were 
. selected. The metastatic prostate cancer patient? had high (> 50 ng/ml) serum concentrations of 
■sPSA;; Circulating nucleated peripheral > blood cells were separated from erythrocytes by 
centrifugation in Vacutaine* CPT 1 ^ tubes (Becton Dickinson and Company, Franklin Lakes, N 
20 , r J). ; Total. RNA was prepared from isolated nucleated peripheral blood cells by lysis with RNA 
v ; : :; Stat^60™^elrTes V Inc., Friendsvsrood, T2£), followmg the instructions provided by %<e vendor. 

*M Gpntaminating; genomic DN A Was removed from the total RNAs by digestion with RNase free 
; t i DNasel (GIB£0-BRL, Gaithersburg, MD>), For the PCR 1 ^ based applications of RNA 
; fingerprinting and relative qiiantitative RT^tPCR^ it is absolutely critical that the/total RNA is 
25, . completely free of genomic DNA. Typically, 5,0 to 10.0 pg of r total RNA was digested with 
::^,^-.O0N4<QNuiiits of RNase free BNasei'in40Qr^.|d. of reaction Volume fop- 20;i»inat:37?C. 
^ : ;r . yp fi^ Following digestion, tlie tptal RNAs were extracted w^^ Inc.* 
Solp^, OH) and ethanol precipitated. To confirm that the RNA .was free of contaminating 
genomic DNA, 500 ng to 1 .0 jig of each Dl^asel frf^&i JRN^V , was These 
30 were used as templates for PCR™ using oligonucleotide primers that anneal to exons 3 and 4 of 
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the gene encoding PSA (exon 3: 5' GCCTCAGGCTGGGGCAGCATT 3' SEQ ID NO-2? exon 
4: 5' GGTCACCTTCTGAGGGTGAACTTGC 3' SEQ ID NO:23). These primers anneal to 
opposite strands of genomic DNA that flank the 145 bp intron 3 of the PSA gene. PCR™ was 
performed at 94°C for 1:1 5 min. followed by 40 cycles of 94°C for 45 sec, 55°C for 45 sec, and 
72°C for 1:15 min, then a final extension of 72°C for 5:00 min; RNA Was considered DNA-free 
if no PCR™ products could be visualized upon gel electrophoresis that co-migrated with the PSA 
gene positive control of known human genomic DNA. If PSA gene products were observed after 
PCR™, the RNA was redigested with DNasel and analyzed again for contaminating genomic 
DNA. After it was confirmed that the RNAs were free of genomic DNA, 500 hg to 1.0 ug of 
RNA was electrophorescd on a 1.2% agarose Tris Acetate EDTA (TAB) to visualize the 
ribosomal RNAs (Fridell et at., 1995). Only RNA preparations for which the 28S ribosomal 
RNA could be visualized were selected for further analysis by RNA fingerprinting and relative 
quantitative RT-PCR™. 

RNA fingerprinting with arbitrarily chosen oligonucleotide primers (Welsh et at, 1992) 
is conceptually similar to differential display (Liang and Pardee, 1992), except that 
oligonucleotides of arbitrary sequence are used to prime both strands of cDNA synthesis instead 
of just second strand synthesis, as in differential display. In this investigation, the strategy of 
*NA fingerprinting used was similar to that described in Ralph e/ a/. (1993) except that 
oligonucleotide primers used were composed of two discrete domains. The 5' domain of these 
oligonucleotides consisted of ten nucleotides that complemented sequences from either the T7 
promoter o*the Ml3 reverse sequencing primer. The 3' domains of these oligonucleotides were 
8-mer sequences predicted to anneal frequently to the prbteii^coding regions of mRNAs in a 
I^ermiscuous fashion (Lopez-Nieto and Nigairi, 1996). These ^ ohgonucle^tides were men used in 
a sequential pairwise strategy that optmiizes tte athount of mRNA complexity mat can be 
surveyed with limited numbers of primers and starting RNA. Care was taken to ensure that the 
two oligohucleotfdes used to produce any single fingerprint did hot share sequence similarity in 
either their 5' ^ Because these oligonucieotides were constructed of short sequence 

ddniains that have specific fuhefions within this experimental design, the oligonucleotides are 
perhuscuous rather ^ arbitrary m nature. -V. 
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Two RNA pools were fihgeiprinted. These two pools were each created by Combining 
/equal amounts of peripheral blood total RNA from five individuals. One pool was constructed 
by pooling RNA from five healthy individuals while the other pool was derived from five 
individuals with recurring metastatic prostate cancer. Using the pooled RNAs as templates, first 
5 strand cDN A synthesis was primed by annealing One of the permiscuous oligonucleotide primets; 
uto the pooled ; RN As at low stringency. All fingerprinting studies were performed in duplicate 
using diricrcril initial concentrations of template RNA. The replicate fingerprints were initiated 
by usine cither 125 ng or 250 ng of RNA as template during first strand cDNA synthesis. 
^Reaction conditions for first strand cDN A synthesis were 250 units of Superscript II™ 
10 ^iiCO -fJRll'Gailhcrisburg, 1 MD) in IX supplier 1 S reaction buffer (25 -ii^^Tris-HCl'(pH=8.30/. 
37.5 mM kCl; tt) niM MgCl 2 ), 10 mM DTT; ^00 yiYA ea&i dNTP, and 2.0 permiscuous 
oligonucleotide in a 40 yi\ volume. The latter was incubated for 1 h at 37°C. Following first 
strand cDN A synthesis, the RNA Was digested with RNase H and heat inactivated at 70°C as 
directed by the supplier. 

15 One- tenth (4:0 jxl) of the first strarifl cDNA reaction niixture was used in the 

fingerprinting PGR™ reaction. As many as ten different RNA fingerprints were generated from 
e&cfilfirst strand cDNA reaction. To the first strand cl3NA, 36 |*1 of a PCR™ iriik isolation was 
added. The latter corVtaixfed SOmMTins^ KGU 200 yxM each dNTP, LO/jxCi 

of a 33 P-dCTP, 2.0 |aM second permiscuous oligonucleotide and 1:0 unit of recombinant Taq, 

20 iONft f>6iyfherase (G I BCO-BRL, Gaithersbur£; MD). Note that the concentration of the first 
' dligonucleotide is now slightly less that 200 nM. PGR™ fingerprinting waS performed With one 

1 " ^ cycles oi^94°C for 45sec, 4& 0 C fo* 1:30 min, and 72°C for 2:00 inin. A final extension step of 
• lf¥C fo# 5 00 was performed. Next, 4 0 (it bf the final PCR^ products were mi^ea with 6.0 pi. 
2§ 6f sequencing formamide dye solution and "denatured by heating to 75°G for 5:00 min. 
A^roxiiiiatdy 2.5 jxl of tfe' denatured PCR 1 ^ ^foducts in formamide dye was eltectfcojphoresed 
• ^ ^^tfii^ gei: ?CR^ ^ 

by iatitoradidi^phy: J ^ " •• 

' v The^ two differentially ap^Sring PCSt™* ^plified cI)NA fragments identified in these 
30 studies that are the subjects of this report were termed UC331 and UC332. UC331 was < 
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identified in a study in which the first; permiscuous primer used in the reverse transcription 
reaction had the sequence 5' ACGACTCACTATAAGCAGGA 3' (SEQ ID NO:24). The second 
permiscuous primer that was used in the PCR™ fingerprinting reaction that identified UC331 
was 5' AACAGCTATGACCATCGTGG > (SEQ ID NO:25). UC332 was identified in a study 
in which the first permiscuous primer used in the reverse transcription reaction had the sequence 
5' ACGACTCACTATGTGGAGAA 3' (SEQ ID NO:26). The second penruscuous primer that 
was used in the PCR™ fingerprinting reaction that identified UC332 was 5' 
AACAGCTATGACCCTGAGGA 3' (SEQ ID NO:27). After autoradiography, bands that 
appeared differentially in fingerprinting reactions on the pooled total RNAs described above 
were cut out of the gels and reamplified by PGR™. The reamplified PGR™ products were 
directly sequenced using the Sequenase™ reagent system (Amersham Life Sciences, Inc., 
Arlington Heights, IL.). 

The sequences of UC331 and UC332 were compared to those deposited in release 101 of 
GenBank (July 1997) using the Lasergene™ software package (DNAstar, Inc., Madison, WI). 
The DNA sequence of these cDNA fragments, when compared to the GenBank database 
revealed that the mRNAs, from which these cDNA fragments were derived, were previously 
uncharacterized. Neither UC33 T nor UC332 are genes whose products have been previously 
characterized as being significant in any physiological pathway, both UC331 and UC332 match 
sequences on the GenBank data base. 

In the case of UC331, these matches are confined to ESTs. UC331 was identical within 
the limits of sequencing accuracy to several human EST sequences. The human EST sequences 
wiAWgh^ 

^^ mRNA. Theends oftheUC331 contig were then used to requery the EST data base 
whereby rnore ESTs were identified that extended the contig. This process was continued until 
the UC331,x)ntig predicted a mRNA with an ORF and a poly^A tail. A description of the 
human ESTs that were used to construct the UG331 coptig are provided in Table 9. The 
sequence of^e UC33 1 contig and the .ORF was identified at its 5' end_ A o{ 
this contig is that the ORF extends all the way to its /oii"'^.^^^!^^^! 
rnRNA fextends further 5^ than is indicated by me c^^^ 
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TABLE 9 
Human 


' ■ ; : - . 


GB Accession Number 


Tissue 


Library 


AA403120; ' 


Total Fetus 




Soares 


.AA401845 


Total Fetus 




Soares 

i - ■ Vi . 


AA1 21473 c . , 
AA1 21262 

*.'•- v.. 


Pregnant Uterus 
Pregnant Uterus 




Soares '">• 
Soares 


R22145 1 y 4 
R22146' 
R30954' «|r 

R31006* <\mk- 

R' : l?Rft7 h - i ' - 

R31390 h 

RB7806 9 

1 iU / UUU 

R67807 9 ; vv: 


Placenta 
Placenta 
Placenta 
Placenta 
Placenta 
Placenta 
Placenta 
Placenta 


.S • - ••■ ! • ' 


Soares . > . = : ^ •• 
Soares 
Soares V 
Soares 
Soares 
- Soares ^ 
Soares 
Soares 2 ? 


AA385620 


Thyroid 




TIGR : 


W37985 ,V^/ 
W37986 . ^ ; 


Parathyroid Tumor 
Parathyroid Tumor 


' • - >VvV. ■ :: ' : -)- 


Soares 

Soares 4 .\ 


AA38040l52^ 
AA182471 :* f ' 

AA181530" ^ 

. '■•ffu-Jt.^.f, 


Cell line (Supt) 
Cell line (HeLa) 

Cell line (HeLa) 


'• ■• •"'*?• »■•" '•'• ,! \ •• • ' ' 1 '•' "•" * • ' 
• - >• • :'••}•• ' " ' • 


TIGR , , ; 
Stratagene ' ' ' •. 
IIMAGE) 1 ^ 
StratageheA v-v:i 
(IMAGE) 


' W31231 ' 


Senescent Fibroblasts 




Soares ^S? 


N22701 
W31175 
M3444B 
N34538 
• \ N36424 

N42854 ^ 
N44299 ^Mfe . 


Normal Melanocyte 

Normal Melanocyte { *V - Y *r • 
Normal Melanocyte 

Normal Melanocyte r ' ^^ixkn : 
Normal Melanocyte 

Normal Melanocyte 5^ r ' a J V 
Normal Melanocyte ''^^!j1fi||;h 
Normal Melanocyte i S^te 


Soares 

••. Soarei^^".^^ 
Soares 

Soar^^B^-A-v. 
Soares 

Soared - - ^'f- 
Soares -MW^i. 
Soares c ^ ^ 
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GB Accession Number v , Tissue 

W56398 Normal Melanocyte 

Normal Melanocyte 
Skin Tumor 



N66813 
AA37999B 



AA370040 
AA369851 

H08822 1 
H08905* v 
HI 9533 
H21379* 
H21421' 



H24360* 
H25176* 
H38689 
H38791 

H39147" 

H39148" 

H45092 c 

H45054 e 

H49928 v 

H50463 

H51403 3 

H51444* 

H528lF 



H52774.V ; . 
R85542 • 
R84652 
AA324855 
AA31721T 



AA371911; 

AA3021lff 

AA247643 
W60049 1 
W61359 71 



Prostate Gland 
Prostate Gland 

Brain (Whole infant) 
Brain (Whole infant) 
Brain (Whole Adult) 
Brain (Whole Adult) 
Brain (Whole Adult) 

Brain (Whole Adult) 
Brain (Whole Adult) 
Brain (Whole Adult) 
Brain (Whole Adult) 

Brain (Whole Adult) 
Brain (Whole Adult) 
Brain (Whole Adult) 
Brain (Whole Adult) 
Brain (Whole Adult) 
Brain (Whole Adult) 
Brain (Whole Adult) 
Brain (Whole Adult) 
Brain (Whole Adult) 

Brain (Whole Adult) 
Brain (Whole Adult) 
Brain (Whole Adult) 
Brain (Cerebellum) 
Retina 



U '-3f ' Ki v ;< 



Pituitary Gland 



S. !; ! M » .'V; /• 

Endothelial Cells, Aorta 

.-•^■•Ji'rv:!'!-. , s - r . 



Fetal Heart 
Fetal Heart 
Fetal Heart 



- ' t^M$# >' .•'■'• :t * 

• **i*V'.-5 , »a-;..- 1! - ' ' •"»# 



Library 

Soares 
Soares 
TIGR 



TIGR ; 
TIGR 

Soares 
Soares 
Soares 
Soares 
Soares 

Soares 

Soares i ^ 

Soares 

Soares 



Soares 

Soares 

Soares 
Soares 
Soares 
Soares 
Soares 
Soares 
Soares 

Soares, 

Soares 

Soares 

TIGR 

TIGR 



TIGR ■ , c: h>^ 
TIGR . "v^a># 

U. Toronto tf > 
Soares v 
Soares 
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GB Accession Number Tissue Library 



> AAZhoDJ 1 


R Polio 


* So ares 


AA234769 


Pooled; fetal heart, melanocytes, pregnant uterus 


Soares 


AA JboZoy 


rancreas 


Stratagene 


AA 150565% 


rancreas 


(IMAbh) 

otratagene , - : ; .._ ,; 






(IMAGE) 


A A IbuoJb ; . 


Pancreas 


Stratagene 






(IMAGE) 


H73822 


Fetal Liver Spleen 


Soares 


N58T80 


! F&tal Liver Spleen ' - • / 


Soares 


W04414- S 
N94254 V 


Fetal Liver Spleen ; ViH P ^ 


Soares V$; m \.' % f'*.<. 


Fetal Liver Spleen 


Soares . v ; 


N75996 


FetaJ Liver Spleen v ^ v A * 


Soares 


N69644 


Fetal Liver Spleen 


Soares • ; , 


T83329 ^ 


Fetal Liver Spleen ^ ; ^ . v : L 
Fetal liver Spleen ,• . ., v . . 


Soares v 


T72755 


Soares 


T53976 


Pooled Fetal Spleens v ^ - J «■ ^ 


Soares' 




^-•^\*^ : mui||pie ocierosis 


ooares 


N90814 


Multiple Sclerosis ? : ^ 


Soares 


N63292, ,? 


Multiple Sclerosis ^ 7 


Soares c; - 


N59233 


Multiple Sclerosis 


Soares 


N53207, ; 


Multiple Sclerosis ?ve&i 


Soares - ^ 




Multiple Sclerosis 


Soares : 


F22624 


Skeletal Muscle, .v. f - - 


CRIBtOtjilyi 



Note:Paired superscripts indicate opposite ends of the same cDNA clone. 

5 •. > ; ; Whpi^ t^rlixin^n UC3ai,^^g wa?^^^ mouse 

region 

M>*\ ^ and 
^ >; : similarity ^th each; other that j^ennitt^d, them to be assembled » iWo j a mouse J0C331 virtual 
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ponti^ in a process that was identical to that used to create the human contig. The mouse UC331 
virtual contig was also observed to have an ORF at its 5' end and a poly-A tail at its 3' end. A 
description of the mouse ESTs that were used to construct this contig are provided in Table 1 0. 



TABLE 10 
Mouse 
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GB Accession Number 

AA0274a7 - 
AA023708 • ^ 
AA023154 T 
AA024303 

W35948 
W 11581 
W36820 
AA002492 
AA097370 ^ 
AA014313; : 
AA450512 
AA408179 1 
AA40826T 1 y 

AA117174 ; 
AA11934&> 
AA183195 ^ 

AA122933^ 

AA42361 3 



Tissue 

Placenta 

Placenta 

Placenta 

Placenta 

Total Fetus 

Total Fetus 

Total Fetus 

Mouse Embryo 

Mouse Embryo 

Mouse Embryo ?f 

Beddington embryonic region 

Embryo fetoplacental Cone 

Embryo Fetoplacental Cone 

T-cells 
Thymus 
Lymph Node 



...SI-'*-: >■;• 



Kidney 

Mammary Gland 



Library 


Clone # ) 


Soares 


459407|5 f ) 


OUcJlt;i> 


456984(5') 


Soares 


456027 (5') 


Soares 


458313 (5') 


Soares 


350258(5') 


Soares 


318665(5?) 


: opares 


336707 (5) 


Soares 


426498(5') 


Soares 


493073(5') 


Soares 


468491 (5') 


IMAGE 


865186(50* 


Ko 


C0025F09 (3') 


Ko 


C0025F09(5') 


Stratagene 


558134(51 


Spares 
Soares 


573567(5') 
636222(5') 


#ittad 


579415 (51 :: ' 


Soares 


832219 (51 



Note-Paired superscripts indicate opposite ends of the same cDNA clone. " 

When the MegAlign™ program of the Lasergene™ DNA analysis software package 

contigs were 

predid^ 

^ieir lengths. This similarity was most striking in the regftm comprising the ptifetiVe ORFs. 
Within the CfRFs the mouse and human contigs, the l)NA sequences are 89% iderifical. In the 
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predicted 3' untranslated regions of the two contigs, the DNA sequence similarity falls to 73% 
with several small deletions and insertions. This higher degree of sequence similarity in the 
putative ORFs as compared to the proposed 3' untranslated regidh is interpreted as evidence that 
the ORFs encode proteins on which natural selection constrains amino acid sequence divergence. 
5 Like the human U C33 1 contig, the mouse contig also encodes a putative ORFthat extends all the 
way to its 5' end. This provides additional support for the contention that the UC331 mRNA 
contains more sequences at its 5' end than are represented by the EST based contigs presented 
.here. , ■"■■■.»*■.."? 

The ORF^ of the mouse and human UC33 1 contigs Were conceptually translated and the 

10 amino acid sequences were compared: The amino acid seiqtiehce of the human UC331 ORF was 
used to query the Swiss, PIR and Translation release 101 using the Laserg^he™ software 
package. For the 1 57 amino acids for \Vhich- this comparison; is possible, the riiouse and human 
sequences are collinear and identical at 15 1 ; positions (96%) with five of the six differences being 
conservative substitutions. This putative protein domain is highly acidic with 26 acidic and 17 

15 basic amino acids. There were also 48 hydrophobic and 43 polar amino acids predicted. When 
either the predicted mouse or human UC331 amino acid sequenced was compared to amino acid 
sequences in the ^public proteji^ sequence data bases, no Significant matches were found to any 
previously characterized vertebrate proteins. However, a significant match was observed to a 
putative protein, termed ZK353.1 (PIR Accession n^ the genome of 

20 the nematode, Caenorhqbditis, elagans. /The^ mammalian arnirio acid sequence is similar and 
collinear with the, C-tenm^ 157 amino acids of the putative C. eUgdns f>Votein. Like the 
■ mammalian UC33X amino .acid sequences* tj^ acid seqiieiice of the 

ZK353. 1 is also highly acidic with 31 -acidic and only 20 basic amino acids. Over the 203 amino 
acids for which a comparison can be made the*ZK353.1 amino afcid sequencer is identical to the 

25 human or mopse sequence at 84 (41%); positions with ariany of the diffeirertces representing 
conservative substitutions* : x ^ • • = ;• ' 5 c 

• •• •»•' . ■ ^ ;. < ' ; • • • . .- . • 

^ The putatiy^ C^etegari? protein, Z^53A;hs& ltd ciin^hlly fitfictidn. Its i&astence 

is predicted froniv the Q elegans genome sequencing effort (Sulstoh ef afe 9 1992). The 
polypeptide; sequence for Z3^^3,l on the' C. elegans 

30 chix>mosome ni (GB accession, iiumber ^ is 



wonn-ms 

106 KnVI)S)>7/22105 
548 amino acids long and ilKludes m ...... 

it ^ ^ ^ that ^™ - - ^ prolein ovct te 

*c possible panen,. of Wprooessfag ^ ^ p - — * «-n«*«; 

° ^"^^-^ml^and™ 

T Q confirm that the human UC33 1 • 
Vwjz did not match anv pqt 

.^^^-^ 

Wlh a GenBank accession number of M7451. * 
Frequently, „ lRNAs ideillil i e(1 fc 

"* * being 

>^ Li, 

P°»ls of cDNA consul by ,' ' " ^ of ^ ^ exammed nonnalized 

"Trent metastatic Breast cancer was also 
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examined. The inclusion of the breast c^cer patient samples in this study was made to 
determine if the mRNAs examibed were being differentially regulated in the iiriMune system in a 
response that was specific for prostate cancer or if the response was more general to metastatic 
cancer in general. Using these pools of cblvfA as templates* triplicate PCR™ was performed. 
5 Each 6f the three replicates were terminated at a different cycle number of PGR™. This format 
? of relati ve quantitative RT-PCR™ insures that the results taken for relative quantitation represent 
f ■• the PGks™ when they are in the log linear portions of their ^plifieafion curves where such 
quantitation is most accurate, 
ft Approximately 1.5-5.0 \xg of E)NA-f¥ee total RNA from the peripheral blood of healthy 

10 ^volunteers or patients with either metastatic prostate of breast Cancer were converted into first 
strdhd cDNA using the Superscript™ Preampiificatibn System for Firsrt: Strand SDN A Synthesis 
(GIBCO-iBRL, 18089-01 iy by the supplier. These 

cDNAs were then normalized to contain equal concehtratibns of airiplifiable cDNA by PCR™ 
amplification of P-^ctin cDlsfA using the primers 5' GGAGCTGCCTGACGGCCAGGTCATC 3' 
15 (SEO ID NO:28) and 5< ^A^CAT^^ 3' (SEQ ID NO:9). A 

typical PCR™ program would be 94°C for 1 : 15 min, followed by 22 cycles of 94°C for 45 sec, 
55 P C for 45 seb-and 72 6 C for 1 :15 min. this was followed by JBrial extension of 72°C for 5:00 
PCR™ products werie visualized by gel electrophoresis through 1.5% agarose TAE gels 
stained with ethidiuhi bromide. Images 6f ? fhe gels were captured, digitized^ and analyzed using 
20 the IS-TOOO Digital Imaging System (Alpha Ihndtech Cc^)- TOS ^cbiibentetiojis of the cDNM 
were adjusted by adding various amounts df water to create cDNA stocks that contained equal 
concentrations of amplifiable 0-actin ct>NA Typically, ihe \c0NA derived from the reverse 
Kvti^seriptioii 6f 510 jig of RNA resulted in enotigh riotmalized cDN A to perform 50-200 
■ - " APG^ reactiohs^ • - ^ - ■ " ' V y^y- ■■■ ■; ' ^ • 

25 ^ Equal amounts of the normalized cDN A Stock fi^M individuals having the same disease 

^ staX6 wdre pooled. Pbols of cDNA^^^ 

" caiicier and metastatic bteast cancer were pibduc&tL Thes6 pools were then exaftrinbti by PCR™ 
for p-actin to determine that they contained equal ^mbunti df ^amplifiable cDNAi - ! ' 

^ Tb demonstrate that all ob^rvations weife ma^ Jog-lihear phase of the PCR™ 

30' amplification curve, a series of PCR™ reactions -nsiiig different cycle htimber were performed ott, 
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each cDNA pool for each gene (primer pair) examined. Display of the PCR™ products oh 
electrophoretic gels and analysis with the IS 1000 Digital Imaging System illustrates that the 
mass of the PCR™ products is increased exponentially with increasing cycle number, confirming 
that the observed results are in the log-linear portion of the PCR™ amplification curve. 

Relative quantitative RT-PCR™ showing near equal amounts of amplifiable p-actin 
cDNA in three pools cDNA. Pools of normalized cDNAs were constructed from peripheral 
blood RNAs from eight healthy volunteers, ten individuals with recurrent metastatic prostate 
cancer, or ten individuals with recurrent metastatic breast cancer. Three separate PCR™ 
reactions were performed on each pool of cDNA. PCR™ was terminated at differing cycle 
numbers (cycle 22, cycle 24, and cycle 26), and the products were visualized by electrophoreses 
and ethidiumn bromide staining. Images were captured and quantitated using a digital image 
analysis system. At all three cycle numbers examined, there are relatively similar band 
intensities representing the three cDNA pools and increasing band intensity with increasing cycle 
number, verifying that the observations are being made in the log linear range of the 
15 amplification curves. Similar band intensities indicate similar relative concentrations of P-actih 
mRNA in the RNAs from individuals from which these cDNA pools were constructed. 

The oligonucleotides used in the relative quantitative RT-PCR™ studies that 
independently confirmed the differential expression of UC33 1 were designed from the .sequence 
in the human UC33 1 virtual contig. These UC331 specific oligonucleotides had the sequences of 



10 



20 1 5 ' V 1 VrtJKJCl ACGGAAGATACGACAC, • 3' (SEQ ID ; NO:31) . and 5' 
ACAATCCGGAGGCATCAQAAACT 3'. (SEQ ID NO:32). These oligonucleotides direct the 
; « - amplification of a 27^ nucleotide long PCR™ product that t ^4|w«^^.UC331. The 
r% oligonucleotides used in the relative quantitative RT-PCR™ studies that independently 
confirmed the differential expression of UC332 were designed using the sequences of the cDNA 
25. with the GenBank accession number D&7451 . These UC332 specific oligonucleotides had the 
,^J) : \sm^^' ^ AGCCCCGGCCTCCTCGTCCTC . 3' (SEQ ID NQ:33) and 5' 
i aG^GGCGGCAGCGGTOTC 3* (SEQ ID NO:34>. These oligonucleotides .direct the 
amplification of a 1 40 nucleotide long PCR™ product that is -specific for f UC332. 
v ; : The] results for relative.level& pf p^-actin expressi those observed 

30 when oligonucleotide primers specificTor UG331 were used to direct PQR™ amplification (FIG, : 
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' r 7)1 At 25 cycles of PCR™, clear bands are visible in the lanes representing the pools of cDNA 
from peripheral blood of patients with either metastatic breast or prostate cancer. In the lane 
representing the peripheral blood of healthy volunteers, only a very faint band is present At 28 
cycles of PCR™, the band intensities representing all three 'pools are brighter than they were at 
5 cycles, but Che relativ e increase in intensity of the bands representing the metastatic cancer 

%ati6nt pools compared to the healthy volunteers remains the same as was observed at 25 cycles 
eff PGR™. This indicates that these observations are being made in the log linear range of the 
PGR™ amplification curv es. At 31 cycles of PGR™, there is still an increase in the intensity of 
^the feands representing the pools of metastatic cancer patients compared to the pool representing 
10 ^Qfie healthy volunteers,; but a quantitati ve analysis of these bands indicates that the PCRs™ have 
; left the log linear range oiF their amplification curves. Quantitation of the data for 25 and 28 
cycles of PCR™ independently confirms that UC33 1 mRNA is differentially regulated and is 
roughly seven fold more abundant in the peripheral blood leukocytes of the average patient with 
either recurrent metastatic prostate cancer or breast cancer than in the peripheral blood 
15 leukocytes of healthy Volunteers. ; 

The second foriiiat of relative quahtitative RT-PCR 1 ^ used to examine the differential 
expression of TJC331 exaniined the relative abundance of UG33 1 mRNA in the peripheral blood 
of healthy iridividiiais or individuals with r^urrent metaktoic Ctocer. The indi viduals 1 examined 
in this study Were the same as those whose cDNAs were combined to construct the pools 
20 examiifed ^ desml^ed Sbdve. Using the iiifbrinatioti obtained from the pooled cDlSTA study to 
predict at what "PCR™ cycle numbers relative quantitative RT-RCR™ would be most 
irtfoilhative; these individuals were 5 ex2Utiined for the relative abundance of p-actiri- amd UC331 
- "%LFtNAs present iii their peripheral blood leukocytes. £CR™ was for 22 cycles.' A3l individuals 
^ ex^ined cciiitaiii roughly fequhl amounts of aiftplifiable ,p-actih cE>NA. Some 6f th6 differences 
25 ' m ^actiii band intensity obiseirved in this study are probably dueto the internal variatabrf inherent 
^ilnis stody^ Results firorft studied esighed to "xjuaiitit^e ^5 intefhal variation indicate that 
%ienti6al r^Jifcatesr of a p^tin PCfc;™ can be ebcj>e<^<^ bands 
r ^ wittfa ^d^d deviation of ±15 \ " < ^ 

? V* ReiatiVfe qte^^ 4 bf cWiA v^ conducted using reverse 

30 RNA isolated froni ^ pferiph^al blood of eight healthy V6lufe^(iroup % 
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ten individuals with recurrent methstatic prostate cancer (group P), or ten individuals with 
recurrent metastatic breast cancer (group B). PCR™ was for 30 cycles. As was seen in the study 
using the pooled cDNAs, the results of the relative quantitative RT-PCR™ for UC33 , ^ 
cDNA from individuals contrasts sharply with that observed for p-actin. The intensity of the 
band representing the abundance of the UC331 mRNA in peripheral blood leukocytes was 
greater for all of the patients with either metastatic prostate or breast cancer as compared to the 
intensity of the UC331 band representing the mRNA level in the peripheral blood leukocytes of 
healthy volunteers. Therefore, the elevated UC331 mRNA levels indicated by the relative 
quantitative RT-PCR™ results using the pooled cDNA templates was caused by an elevated 
mRNA level in all individuals comprising the pools and not from a subset of individuals with 
very high elevations in UC331 mRNA levels. This study is a second independent confirmation 
of the differential expression of the UG331 mRNA. 

As is indicated by the wide distribution of tissues from which the ESTs used to assemble 
the UC331 contigs (Table 9), UC331 is widely expressed in many tissue and cell types. 
However, because most of ESTs comprising UC331 are from normalized libraries, little 
information can be gained from this data on the relative abundance of the UC331 mRNA in 
different tissues. Also, while the extension of the QRPs of the mouse and human UC331 contigs 
all the way to their 5' ends and the similarity of mammalian UC331 mRNAs to a much larger 
putative C. elegans mRNA both predict that the mammalian UC331 mRNA extends even further 
5', the exact size of foe UC331 mRNA was unknown. To address all of these issues, aNorfoern 
blot of poly-A pjus RNA from eight different human tissues was probed with the 850 nucleotide 
long RT-PCR™ product described abqye labeled with ^P, Approximately 2.0 tig of poly-A plus 
RNA from spleen, mymus, prostate, testis, ovary, small intestine, colon, and peripheral blood 
leukocytes were loaded in each lane. UC331 mRNA is expressed in all ei^t human tissue and 

a message size of approximately 1.75 ^. Interestingly, 
bI 99^ l^Pcy^es but Js higWy the thymus, 

demonstmtmg a difference^ of different devefopmentajl ^ges in the 

immune system. UC331 is most abundantly expressed in the te^. The 1QC3? 1. mRNA is about ,' 
1-75 Jcb which indicated that the mRNA only extends about 350jiuclro 5' than is 

, accouirted for Jby foe ^ja^^i^-tn^ <^f^\ 
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virtual contig is shown in SEQ ID NO:30. Cleatrly, the: putative C. elegarismRNA extends much 
• more 5' than do the mammdian mRNA species; 

The other jgehe identified sis being differentially regulated in this RNA fingerprinting 
sttiidy was UC332. UC332 was analyzed in riiuch the same way as UC331 was. When the 
5 sequence of the cDNA fragment from the RNA fingerprinting gel representing UC332 was used 
- tb query GenBank, no ESTs were identified; The sequence of the UC332 cDNA fragment did; 
r however, identify a sequence of a full length cDNA, KA000262 (GB:accession number 

: * 1387451). The sequence of KA000262, (hereafter referred to interchangeably with the name, 
T was determined as part of a project to examine previously unidentified mRNAs 

10 expressed in the bone marrow myeloblast cell line; KG-1 (Nagase et aL 9 1996). This mRNA 
Contains an ORF encoding a putative protein with 761 amino &cid sequence. Perhaps the most 
striking feature of this polypeptide sequence is the appearance of -a C3HG4 RING zinc finger or 
RIJsTG finger motif (Freetiiont; 1993) lodated between amino acids 175 and 216. The RING 

■ finger domain binds twd zinc ions in a conserved structure that has been resolved (Barlow et aL> 
15 1 994). RING finger doitiains have beerif identified in dozens of proteins derived from eukai^otes 

as diverse as yeasts, flies, birds, nematodes arid humans. In most of these cases, the RING finger 
containing proteins have been shown to be essehtial for some important biological process 
although the these processes vary considerably one from another^ Among these mammalian 
enboded RING finger proteins afe several genes implicated iii the ontogeny of cancer including 
2© Ihe ret vir&l oncogene (Takahashi et al , £98 8) andT>mM, a gene whose product collaborates 
wfth-hiyc iiidueed transformation (Haujrt et al , 1 99 1 ). • The BRCA-1 tumor suppressor gene 
- r involved in hereditary breast arid ov^ari cSShcfer susceptibility contains a RING fingeif domain 
^ (Mila it 1994), arid MAT- 1, a riovgl 36 kDa RING fihger prikeiii, is teqti&ed for the 
r> assembly of ? enayriiatically active CDK7- cychn H complexes (Tassan et aL 9 1995). A 
25 comparison of the RING fitiger domains of UG332 and various re|>^entefive ? mehibe^ of this 
• ^ ^ indicates the 

■ H ■■^:..£ i,: ^a3Sj5 finger d^ih^ of UC332 i^ tho$e found in the suppressor 
> ^OXj^^ Of tr^scri]ptiori prWeiii, ipt-1. H6weVer, BRCA1 and 
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Proteins with RING finger motifs exhibit heterogeneity in their subcelluar localizations. 
Some, that arc important regulators of differential gene regulation, localize to the cell nucleus 
When the amino acid sequence of UC332 was scanned for evidence of subcellular localization 
two domains were identified that contained sequences for putative nuclear localization signals' 
5 (NLS). NLS are highly basic, stretches of six are more amino acids of which at least four are 
basic thauend to be flanked by acidic amino acids and/or prolines (Boulikas, 1994). Both of the 
putative NLS in UC332 longer and more basic than the minimum requirements for the^nsensus 
NL$ motif. The first of these putative NLS motifs occurs between amino acid 548 and 567. 
Within this domain, 13 of 19 amino acids arc basic. In fact, this domain could be viewed as two 
10 NLS ^ tandem separated by two glutamic acid residues, If divided this way, the first NLS 
domain would Jiave \pf eleven positions as basic amino acids while the second motif would 
: have 5 of 6 amino acids being basic. The second NLS motif in UC332 is located near the 
C-terminal end between positions 739 and 750 in the amino acid sequence. This domain has 8 of 
12 amino acids as basic, residues, with a core of 5 consecutive lysines and arginines. The 
15 presence of these putative NLS in the amino acid sequence ofUC332 suggest the possibility that 
UC332 plays an important role in regulating the expression of other genes. Finally, the amino 
acid sequence of UC332 lacks a signal sequence for cellular export or an obvious hydrophobic 
transmembrane- domains. : <,,... 

^ ? ^ • To independently verify that UC332 mRNA is more abundant in the peripheral blood 
20 , i ^ five peripheral blood 

leukocytes of healthy volunteers, .relative quantitative RT-fCR^ w^ the ^ 

™$ formats as were used to investigate the differential regulation of UC33 1 A relative 
quantitative RT^CR™ ; study using UG332 specific oligonucleotide primers and cDNA pools as 
^mplates wa^ conducted. , At 25 and 28 cycles of PGR™, the amplified DNA band representing 
25 o &e relatiye abun^ UC332 mRNA is stamed more intensely for those, reactions that 

^rJ^^^m^t^^ ppolsxpnstnrcted from the peripheral blood Jeukpcyte RNA isolated from 

System (Alpha Innotech, Inc.) mdicates mat UG332 : 
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the peripheral blood leukocytes of metastatic cancer patients compared to healthy volunteers. At 
31 cycles of PCR™, the reactions have left the log linear range of their amplification curves. 

In a second relative quantitative RT-PGR™ study using UC332 specific oligonucleotide 
prirners, peripheral blobd leukocyte cDNA from the individuals that comprised the pools from 
the peripheral blood of eight healthy volunteers, ten individuals with recurrent metastatic prostate 
cancer, or ten mdividiials with recurrent metastatic breast cancer were examined separately. 
PCR™ was for 26 cycles: TTie results of this study are similar to those obtained when the pooled 
cDNAs were used as PCR™ templates. All of the cancer patients had higher levels of UC332 
mRNA in their peripheral blood leukocytes than did any of the healthy volunteers. 

In this study, the inventors showed that UC332, encoding a RING finger protein, is iip 
regulated in the peripheral blood leukocytes of patients with either recurrent metastatic breast or 
prostate cancer. From the literature, RING finger proteins have been shown to participate in the 
regulation of several important lymphocytic processes (Patarca et al, 1988; Fridell et al\ 1995; 
Takeuchi et al. 9 1996; van Arsdale et aU ? 1997; Nakano et al y 1996). The observed differential 
regulation of the RING protein encoding mRNA, UC332, in the immune response of patients 
with metastatic breast or prostate cancer, strongly suggests that UC332 participates in regulating 
this immune response. 

All of the compositions and methods disclosed and claimed herein may be made and 
executed without undue experimentation in light of the present disclosure. While the compositions 
and methods of this disclosure have been described in terms of preferred embodiments, it is 
apparent that variations may be applied to the composition, methods and in the steps or in the 
sequence of steps of the method described herein without departing from the concept, spirit and 
scope of the invention. 

More specifically, it is apparent that certain agents which are both chemically and 
physiologically related may be substituted for the agents described herein while the same or similar 
results would be achieved. All such similar substitutes and modifications apparent to those skilled 
in the art are deemed to be within the spirit, scope and concept of the disclosure as defined by the 
appended claims. ' 

UC325-1 is derived from the IL-8 gene (Genebank Accession #M28130). UC325-1 and 
UC325-2, an alternatively spliced form that includes the third intron of the IL-8 primary transcript. 
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are transcribed from the IL-8 gene. Our definition, of IL-8 gene products means all mRNAs 
transcribed from the IL-8 gene, the polypeptide encoded by those mRNAs and their post- 
transiationaJly processed protein products. 

Those practiced in the art will realize that there exists naturally occurring genetic 
variation between individuals. As a result, some individuals may synthesize IL-8 gene products 
that differ from those described by the sequences entailed in the.GenebatJc number listed above 
We include m ,u,r definition of IL-8, those products encoded by IL-8 genes that vary in sequence 
from tW d^ribcd above. Those practiced in the art will realize that modest variations in DNA 
sequence w.II no, Mgnificantly obscure the identity of a gene product as being derived from the 
IL-8 gene. n 
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SEQUENCE LISTING 

- . . . ' i . -» 

% ( 1) GENERAL INFORMATION : I ..:.'=. 

li) APPLI^A^T: ' J - : - • • • T . J-... (j-.. / ■ 

(A) NAME; UROCOR, Inc. < 

(B) STREET z 800 Research Parkway ^ 

(C) CITY: Oklahoma City 

(D) STATE : Oklahoma ^ ° > ' 

(E) COUNTRY; USA 

(F) POSTAL CODE (ZIP) : 73104 

(ii) TITLE OF INVENTION: DIAGNOSIS OF DISEASE STATE USING mRNA 
PROFILES 

(iii) NUMBER OF SEQUENCES: 34 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patent In Release #1.0 , Version #1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 253 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GGCAGGGGCT TGTGACTCTA AGATGGCTTC ATTCACATGC CTAGGGCCTC AGTAGGATGA €0 

CTGGCATGGC CCTGGAAAAC TGCGAAGTCT TCTCTCTGTG C^CTTTCA CCTGGACTTT 120 

TTATATGATT CTGGAAGTAT TCCAAGAAGG CAAAAGTAAA AACTGCAAAG CGTCTTAAAA 180 

TAGAAGTTCA GAAGCCACAT TATATCACTT CTGTTGCATT CTATCAAAGC AAGTCACAAG 240 
CCCCTGCCAA TCA 



253 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(r>) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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CACACACTCC CCCATTCTGA GCCCCAAGAG GCTCATCCCT AAGGATGTCC AGAGATCCAA 60 

GTGGAGAAGG AGAATGTGGT GAGGCTATTT ATTCCCCCAG TGCCTTCCCT GCTGGGCTAT 120 

GGATGAACAG TGGCTGACTT CATCTAGGAA AGAGCTATGG CTTCTGTCTC CTGGAGCTCA 180 

CCA 183 

(2) INFORMATION FOR SEQ ID NO: 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
•(D) TOPOLOGY: linear. 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3 : 

GGTGAGCCCC -AGGAGAGAGA AGAG ATATGA GGAAATTGTT AAGGAAGTCA GCACTTACAT 60 

TAAGAAAATT GGCTAGAACC CCGACACAGT AGCATTTGTG CCAATTTCTG GTTGGAATGG 120 

TGACAAGATG GTGGAGCCAA GTGCTAACAT GCCTTGGTTC AAGGGATQGA AAGTCAGCCG 180 

TAAGGATGGG AATGCCAGTG GAACCACGCT GCTTGAGGCT CTGGACTGCA TCCTACCACC ^ 240 

AACTCGTCCA ACTGACAAGC CCTTGCGCCT GCCTCTCCAA GGATGTTCTT ACAAAATTGG 30O 

TGGTATTGGT ACTGTTCCCT GTTTGGCCGA ATTGGAAAAC TGGTGTTCCT CCAAACCCCG 360 

GTTATGGTGG GTTTCCTCGT CCTTGGA • 38? 

(2) INFORMATION FOR SEQ ID NO: 4: / <■ 

(i) SEQUENCE CHARACTERISTICS: .... "... ■ 

(A) LENGTH: 366 base pairs \ ■ \f'K : -]\ >.[.']':*'■'■' " ' 

(B) TYPE: nucleic acid i >: - - ' -a 1 - ^ : • 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear / r V*'~'/?\\KC^V 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4;: 

GGGCGGAACA AGGGAGCGCT AAAAGGAAAT TAGGATGTGA GG^GGATAAA GGAACAyAAT SO 
TCCAAAACCT TTCCAAACCC CAAATTTATT GAAAGGAACT GAGGAGTGGA TTGAGGAGTG 12 Q 

GACCAACACT GGCGCCAAAC ACAGAAATTA TTGTAE^CT^ ^ -180 
GTCTGGGCCC CAAGGAAAAC TGGGTGCAGA GGGTTTGT*^ 240 
AGAATTCATA AAAAAATTCA TTCTCTGTGG TATCC3&fcf^ $(^GT^AAGji TCCOAGTGAA 300 
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ACTTCAAGCA AATCTACTTC AACACTTCAT GTATTGTGTG GGTCTGTTGT AGGGTTGCCA 360 
GTTGTT 

366 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 base pairs 
CB) TYPE: nucleic acid 
(C) STRAND EDNESS : single 
V (D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: • V 

GCTTGGGCCC CAAGGAAAAC TGGGTGCAGA GGGTTGTGGA G^GTITTTG AAGAGGTAAG 60 
TTATATATTT TTGAATTTAA AATTTGTCAT TTATCCGTGA GACATATAAT CCAAAGTCAG 12 0 

CCTATAAATt TCTTTCTGTT GCTAAAAATC GTCATTAGGT ATCTCCCTTT TTGGTTAAAA 180 
AAAAAAGGAA TAGCATCAAT AGTGAGTGTG TTGTACTCAT GACCAGAAAG ACCATACATA 24 0 

GTTTGCCCAG GAAATTCTGG GTTTAAGCTT GTGTCCTATA CTCTTAGTAA AGTTCTTTGT 300 
CACTCCCAGT AGTGTCCTAT GTTAGATGAT AATGTCTTTG ATCTCCCTAT TTATAGTTGA 360 
GAATATAGAG CATGTCTAAC ACATGAATGT CAAAGACTAT ATTGACTTTT CAAGAACCCT 420 
ACTTTCCTTC TTATTAAACA TAGCTCATCT TTATATTGTG AATTTTATTT TAGGGCTGAG 480 
AATTCATAAA AAAATTCATT CTCTGTGGTA TCCAAGAATC AGTGAAGATG CCAGTGAAAC • •'■>' 540 
TTCAAGCAAA TCTACTTCAA CACTTCATGT ATTGTGTGGG TCTG1TCTAG GGTTGCGA S9& 

(2) INFORMATION FOR SEQ ID NO: 6: ^ 

(i) SEQUENCE CHARACTERISTICS: V^-::'?*?^^^ "■' r *' 

(A) LENGTH: 22 base pairs - : ^ 

(B> TYPE: nucleic acid r v "V ^ 

(C) STRANDEDNESS : single 7v 
CD) TOPOLOGY: linear 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CGCCTCAGGC TOGGGCAGCA irt \ ' ' :V ^X:^'^^)^'- ^-'^^^^i^ 22 " 



(2 ) INFORMATION FOR SEQ ID NO : .7 1 • ■ 

l±) S^QUENfeg CHAMCT^ISTI CS : - - -, 
(A) LENGTH: 25 base pairs 
13) TYPE: nucleic acid. 
(C) STOANDEDNEiSS : single 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO r 7 : 
ACAGTGGAAG AGTCTCATTC GAGAT 25 

(2) INFORMATION FOR SEQ ID NO: 8: J 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs. 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single : ^ r 

(D) TOPOLOGY: linear 

- (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 8: 
CGAGCTGCCT GACGGCCAGG TCATC • , >V 25 

(2) INFORMATION FOR SEQ ID NO : 9: ^ ^ ^ : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs ' 

(B) TYPE: nucleic acid . ? 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GAAGCATTTG CGGTGGACGA TGGAG . - 25 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : : ^p^V .; : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid ';■ ^ : ..• 

(C) STRANDEDNESS : single' \, "' "^^^ 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TGCAAACTTT CACCTGGACT T ' L /*C>- : -.; *. v v 21 

(2) INFORMATION FOR SEQ ID NO : 11 : ^ v " 

(i) SEQUENCE CHARACTERISTICS : : \^ f;: 

(A) LENGTH: 24 base pairs "±a^t';£ .;' ; / ' ■ 

(B) TYPE : nucleic acid , . . 

(C) . STRANDEDNESS : single r . V. ' 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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CTTGTGACTT GCTTTGATAG AATG 

24 

(2) INFORMATION FOR SEQ IB NOr 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear y 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 : 

GACAACATGC TGGAGCCAAG TGC 

23 

r' ... ... .. ■-. - "■■."* 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY :• linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID. NO : 13: 
ACCACCAATT TTGTAAGAAC ATCCT 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NQ: 2,4 t : 
TGTCCAGAGA TCCAAGTGCA GAAGG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs ,. \ ' 

(B) TYPE: nucleic acid v.. : V:V : . ?/>V' 
(G) STRANDEDNESS: single ^ 

(D) TOPOLOGY: linear ' " ' / " - "■ : \""' V -- "'"""'^ ■ " ' 

(Xi) SEQUENCE DESCRIPTION: SEQ ID 15 -V" :'J ^rf;'^-^ '" ' " 

GAGCTCCAGG AGACAGAAGC CATAG : , 



25 



25 



. 25 
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(2) INFORMATION FOR SEQ ID NO: 16: v 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid : «;-.■- 

(C) STRANDEDNESS : single - - " 

(D) TOPOLOGY: linear : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGGCCCCAAG GAAAACT 17 

(2) INFORMATION FOR SEQ ID NO: 17: ' ; 

(i) SEQUENCE CHARACTERISTICS : ; !iv : : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic; acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

TGGCAACCCT ACAACAGAC V ' - ■ " " 1 - 19 

* 

(2) INFORMATION FOR SEQ ID NO: 18: - * 

(i) SEQUENCE CHARACTERISTICS : ^ ; ; V • .- 'v-V V ^ 

(A) LENGTH r 17 base pairs * 

(B) TYPE: nucleic acid : 

(C> STRANDEDNESS: single >S 

(D) TOPOLOGY: linear i ^ • - V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GGGCCCCAAG GAAAACT / ' V N S P HP-^'; : - > *i 7 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH- 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : kindle 3 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



. .. . 



TGGCAACCCT ACAACAGACC v : ^ ? ;-v V "20 

(2) INFORMATION FOR SEQ ID NO: 20: -'"^^ ; -: v*~». OC; ' 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20 
ACATTGAAGC ACTCCGCGAC 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 21 : 
AGAGTGGCAG CAACCAAGCT 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GCCTCAGGCT GGGGCAGCAT T 

(2) INFORMATION FOR SEQ ID NO: 23: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GGTCACCTTC TGAGGGTGAA CTTGC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single ■ 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ACGACTCACT ATAAGCAGGA - '\ • 20 

(2) INFORMATION FOR SEQ ID NO: 25: ; >1 ^ 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B ) TYPE : nu c 1 e i c a eid . ... , v Y ! , v - 

(C) STRANDEDNESS : single '/:,V#:.vi'v 

(D) TOPOLOGY: linear ,. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: , 
AACAGCTATGf ACCATCGTGG , . . ::■ .... 20 

(2) INFORMATION FOR SEQ ID NO 26 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base; pairs < 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS s single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION, SEQ ID NO: 26 t 
ACGACTCACT ATGTGGAGAA v " 20 

(2) INFORMATION FOR SEQ ID NO s 27s ; 



(i) ..SEQpp^NCT;, .OGVRACTfiRISTICS: . 
(A) LENGTH s 2 6 base pairs 
; " ; (B) :TYPfe : nucleic acid j - 
(C) STRANDEDNESS: single 
V- (D) TOPOI^)OT : linear : • . , v ' , — ; . 

(xi) SEQUENCE DESCRIPtioN: SE& ID NO: 27: - ; '■' • - > 

: r AACAGCTATG ACCCTGAGGA . , , 20 



(2) INFORMATION Pt>K SEQ ID NO: 28: ■ 

(i) SEQtfeNGE CHARACI^RIS*TICS s */ 1 

^) ?t»ENdiTH? 25 1 Das^ai^fs -V"-- * - r ""* - V ' V \>-:^ : 

(B) TYPE: rnicleic acid . ^ 

,.,^/(C) l . : .S'pM^EPNES.S.: ..single , ...\.V^ v.> . .. y 

(D) TOPOLOGY ; linear * " ^ v / \ - J>- - ° 
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55 lr?^^cWH^ : %- ; - 1 65 

TTT GTT GAA AGT GAT GCA GAT GAA GAG CTT CTG TTT AAT ATI CCA TTT 
Pne Val Glu Ser Asp Ala Asp Glu Glu Leu Leu Phe Asn He Pro phe 

7o ..- ■ - 75 80 

t2 Sv ISS? 'f* 0 AAAGGCATCOT ATA ATC GGA GAG ' GAT GAT 

Thr Gly Asn Val Lys Leu Lys Gly He He He Met Gly Glu Asp Asp 

GAC TCA CAC CCC TCT GAG ATG AGA CTG TAC AAG • AAT ATT CCA CAG. ATC3 4 » 
Asp S*r His Pro Ser Glu Met Arg Leu ^M^M^*^' 



105 . 



TCC TTT GAT GAT ACA GAA AGG GAG CCA GAT. CAG ACC A^T' CTG AAC 

Ser Phe Asp Asp Thr Glu Arg Glu Pro Asp Gin Thr Pne^Sel Leu W 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GGAGCTGCCT GACGGCCAGG TCATC ' 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single ~ 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 115. .744 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 29: 
GCGGCAGGCG CGGCAAATTA CGTTGCCGGA GCTGAACGGC GCGGCTGGTC TGAAGGCAAA 

CAAGCGAGCG AGCGCGCGAT AGGGGCCGAG AGGACGCGCA GGTGGCGGCG TTGC ATG 

Met 
1 

TCG CAC GGT CAC AGC CAC GGA ATG GGT GAC TGC CGC TGC GCC GCC GAA 
Ser Hxs Gly His Ser His Gly Met Gly Asp Cys Arg Cys Ala S Glu 
5 10 is 

^ GAG 6AG CCG CCC GAG CAG CAC GCC ATG GCT ACG CTG TAC CTG CGC 
Arg Glu Glu pro Pro Glu Gin His Ala Met Ala Thr Leu Tyr Leu Arg 

A lS ^ ™ ~ ^ ™ TGC "5" ^ GAG AGC CGC GAG GGC AGC 2*1 

Ser 
45 



(SO 
117 

IS 5 

213 



Tlo Tic r IV AW tir GAG AGC CGC GAG GGC AGC 

He Asp Leu Glu Arg Leu Gin Cys Leu Asn Glu Ser Arg Gliv Gly Ser 

3 * 40 • 45 " — ' ^ 

GGC CGC CGCGTCTTCMG CCG TGG GAG (^CdG 'i^^^^^^^ 3 A 9 
Gly Arg Gly Val Phe Lys Pro Trp Glu GlU^ ^W^jS* 



357 



405 



501 
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115 120 125 

CGG GAT CTT ACA GGA GAA TTA GAG TAT GCT ACA AAA ATT TCTj CGT TTT 549 
Arg Asp Leu Thr Gly Glu Leu Glu Tyr Ala Thr Lys lie Ser Arg Phe 
130 135 140. / 145 

TCA AAT GTC TAT CAT CTC TCA ATT CAT ATT TCA AAA AAC TTC GGA GCA 597 
Ser Asn Val Tyr His Leu Ser lie His lie Ser Lys Asn Phe jGly Ala 
ISO 155 ±60 

GAT ACG ACA AAG GTC TTT TAT ATT GGC CTG AGA GGA GAG TGG ACT GAG 645 
Asp Thr Thr L^s Val Phe Tyr lie Gly Leu Arg Gly Glu Trp Thr Glu 

165 170 175 1, 

CTT CGC CGA CAC GAG GTG ACC ATC TGC AAT TAC GAA GCA TCT GCC AAC 693 
Leu Arg Arg His Glu Val Thr lie Cys Asn Tyr Glu Ala Ser Ala Asn . 
180 185 190 

CCA GCA GAC CAT AGG ... GTC CAT CAG GTT . ACC CCA CAG ACA CAC TTT ATT 741 
Pro Ala Asj> His Arg Val His Gin Val Thr Pro Glh Thr His Phe He 
195 200 2 05 

TCC TAAGGGCTGG CCAAGGCTCC CATAGAGGCG CTGTGTCAGT GAAGATGTAC "794 
Ser 

210 . >.... 

GACTACCTGT TGGGAAGGAC ^AAGGGATGA GGCTCCAGAG AGAGTTGGCT GCCACAGCTC 854 

TGCCAAGCTT TGTCTlTGG<3 GCTTGCTG^A GAAACCTGGC CTACGGAAGA TACGACACCA 914 

CTGGGAGGGT TGTGTAGGTG ^CCAGGrGGACC -ATCGTGGTTC TCTAGGGCGC TGTGGAAATT 974 

GGGTCTTGGG CTGGGTGGCA' TCTGGCAGTC ATGGGTAAGA CTTGCTTTTC CAGTTAATGT 1034 

GGCCATGTGA TTCCAAGTGT CATCSTTdCTT TGTGGAAGAT TGTTGTGTGA CTTPGTTTTrt 1094 

TGATTTTGTA TTTGTTTTTT TAAAGGAAAC TATTTGTGCSG CTATAGjGAAA CT7TTCTGATC 1154 

CCTCCGGATT GTGTTAGTAG TAGCCA^CAG GAGGGTCTCC. AACTAAAACA CTTGTTCCTG 1214 



CTTGCTCCTT : ^GCCC^CTdA TT^TtCAdCA TTCTTGTCAA GT^CCCAQS TTGGAGTTGT " "* 1274 

CTGTCACGCA CATGl'GTCCT GTGGTTATAG CTAGAAGGAC AGGAGTCTCC TGCTGATGCG ,13,34 

TGATAGCTTA AGCTTC(SC3GA GAAGGTCTTT TCCACTGCCT AGCTAAGCAG TCTCGGGAGA 1394 

GCMGGGG^^C?^ TAAGATTGAG5 ACTTA^tTAA 1454 

GATTCCCCTT GGAAATTCCT TAATGTTTAT TAGCTTCTAA CTAGTGTTGT AAGTCCGATG, 1514 

CCAGAATTTG GAGATTTGAG T T CT TCTTT T CATGGCTTTT ATTCACTGTG ACTAATAACSC 1574 

TTCCTAATAA ATCCTTGCCA GACTT ^ ;j ; . * 5 ? 9 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 210 amino acids 

(B) TYPE..: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: .' protein ' 

(Xi) SEQUENCE DESCRIPTION: SEQ lb NO: 30: 



Met Ser His Gly His Ser His Gl y Met Gly Asp cys Arg Cys Ala Ala 

5 J- . • 10 ' 15 

Clu Arg Glu Clu Pro Pro Glu cm Hi, Ala Met Ala Thr Leu Tyr Leu 

25 30 
Arg lie Asp Leu Glu Arg Leu Gin Cys Leu Asn Glu Ser Arg Glu Gly 

40 45 

Ser Gly Arg Gly Val Phe Lys Pro Trp Glu Glu Arg Thr Asp Arg ser 

55 60 ■ 

Ts ^ Val G1U Ser •*£ Ala As *> G1 « oiu^ ^ Phe ^ Ile Pro 

80 

Phe Thr Gly Asn Val Lys Leu Lys Gly He lie He Met Gly Glu Asp 

90 35 
A*P Asp ser His Pro Ser Glu Met Arg. Leu Tyr Lys Asn He. Pro Gin 

X0.S Ho 

«et Ser phe Asp Asp Thr Glu Arg Glu Pro Asp Gin Thr Phe Ser Leu 

y - ' ■ V"'- .'- 120 .\>,„ . 125 . ..... 

" Asn Arg Asp Leu Thr Gly Glu Leu Glu Tyr Ala Tur Lys lie Ser Arg 

■•■ 135 - ■ : -i4o : ■ ■ :H;r- 

^5 Val Tyr His Leu Ser lie His lie Ser Lys Asn Phe Gly 

Ala Asp Thr Thr Lys val phe Tyr He Gly, Leu Arg Gly Glu Trp Thr 
^ ^'^S His vai Cys Asn Tyr W Ala Ser Ala 



185 



JVsn Pro Ala 
195 



190 



,. ^? H ^ Hi * ^ .-^i>?^^^^.«l# Phe 



200 



205 



lie Ser *f> f-;: ■./--V : ;>-- , vVv-.:^-.:^-..,-*. 

.... 2x6t ••..-=.■. .... : 7" '., v; '.' 7 : " 

INFORMATION FOR . SEQ ID NO: 31: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear,: . ;?V^v'- ♦*;>;•.' ' -V^ 



(Xi) SEQUENCE DESCRIPTION : • SEQ ip NOi 31 : ; 



CTGGCCTACG GAAGATACGA CAC . " - ^23 



(2) INFORMATION FOR SEQ ID NO: 32: ^ T i S . \ 

? N U) SEQUENCE CIIARACTERISTICS: • , , v v. : r . - •'' 
(A) LENGTH : 23 base pairs 

y (L> TYPE; nucleic acid •• f r -c. ' 
^---•^'M^ "'•{cV'stRANDeDNfeSS: single '* 

(D) TOPOLOGY: linear " : ; ; !r- ; v.: ?: ^' ''V''"--"' 1 " - ' V 

. **JxiK S EQUENCE DESCRI PTTION : l SEQ it). iNO ; ' 3 2 : ' V ' : 

ACAATCCGGA GGCATCAGAA ACT 23 

(2) INFORMATION FOR SEQ ID NO: 33 : . ■££?"- -C 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs t 

(B) TYPE : : nucleic 1 acid '*• 

(C) STRANDEDNESS: single . . • ' 

(D) TOPQLG^Y: linear / 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



AGCCCCGGCC TCCTCGTCCT C ... . 21 



(2) INFORMATION FOR SEQ ID NO : 34 i V < 

(i) SEQUENCE CHARACTERISTICS: ( 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single * ^ 
V v <b) TOPOticteYr linear 

(X&) Sfi0tfe(^ DE^CRiPTlbNs SEQ ID NO: 34: 



GGCGGC^SCA GCGGlrfCTC 19 

.■ ; • ,- r - . - . ,0 . -: < .---^' 4HH>&t ^S'V" - *<" ^ 'I /'^ 1 . : * 
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CLABVIS: 

1. Ame ^od for identifying markersforadise ^ e . c . . 

5 to exhibit said disease state and a second set of mRNAs from one or more subjects kno 

normal subjects; ^~« V ' aM ^'^^*c ra n 
b) amplifying both sets of mRNAs to , 

2 - The method of claim 1 further ^ 

3- The method of cJaim o i_ 

amplificatkm step '""'^ " C< "" binMi<> " "fix** -TO used as j, fte 

5 - The method of claim 1 wherein tK» ^- 



The method of claim 5 wherpm a„ j ; 

a™ wherein the disease state is metastatic j 

: 84316 is metastatic breast cancer. 



: prostate cancer. 

7- The method of claim 5, wherein the disease/ 
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The method of claim 1, wherein said subjects are laboratory animals. 
The method of claim 1 , wherein said subjects are humans. . 

; A method of detecting a metastatic cancer disfease state in a subject, comprising the steps 

a) detecting the quantity of expression of a metastatic cancer disease marker 
fed in peripheral blood of said subject; and 

b) comparing the quantity of expression of said marker in peripheral blood of said 
" subject to the quantity of said marker expressed in peripheral blood of one or more normal 

subjects; 

wherein a difference in quantity of expression of said marker in peripheral blood of said subject 
relative to quantity off expression of said marker in peripheral blood of said one or more normal 
1 5 individuals is indicative of a metastatic cancer disease state. 

11 i The method of claim TO, whereiii said disease marker is an inRNA. 

12. The method of claim II , wherein said niRNA is arhplified by an RKA polymerase 
20 reaction, "-^.j ; " ■ 

T3;X ^^eineilidd of claim 1 1^ Wherein is amplified by reverse transcriptase 

25 14. The^thfcd of claim 10, whfefein iaid detecting is by KNf A fittgeiprinting, branched DNA 
or nuclease protection assay. 

15. The method of claim 10, wherein said metastatic cancer disease state is metastatic 1 
prostate cancer. 

30 
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el, ^ ^ of c,aim ,0 - " ta *'" u ~ c — - - *~ 



17. The memod of claim in which said ^Aoo^^o.^^^^^ 

— **~» -i" - Geneban* Acc^on munbej^ ' 
T030.3, M28130, Y00787, SEQ ID NO: I . SliQ ID NO:2, SKQ ID NO - " s ^'' ' 

NO : 4.sr:oi|>NO:5„ r SKQIDNO:29. V "OJ.SEQID 

18. iW-Uil.,** ; 10 „ which said^ Misa ^ ofAetot ^ 8geije 

20. Hk- method of data 1 1, further defined as convristag the steps of 

lmirkcr ;' —vcly amplify „, teas, . ^ „ _ ^ ^ 

amplification products; 

c) detecting said nucleic acid amplification products; and 

<0 measuring the amount of said nucleic acid a mpi ificafion produce fpro^ 

The methodof claim 20 in whichsaidprin^s aro setectcd roiro^ce anamphcon 
havmg a sequence of or complementary to a sequence of at . «n k 

0«n«h»nt a • sequence ol at least a 50 base conttguous segment of 

Genebank Accession numbers D8745I, TO3013, X03558, M28I30, Y00787, SEQ ID NO'l 



22.^ The method of claim 21, wheroin said amplicon is ton about 50 to about 500 bases 



in 



WO 98/24935 PCT/US97/22105 

141 

23. The method of claim 2 1 , wherein said amplicon is from about 1 00 to about 4 1 5 bases in 
length. 

24. The ihethod of claim 1 0, wherein said meta^tib c^cer disease marker is a polypeptide; 

5 ' ■ r ' •• ' •■ ■ : K 

v25. The mietlibd of claim 24, wherein said polypeptide is encoded by a nucleic acid sequence, 
comprising the sequence disclosed herein as Genebank Accession numbers D87451 , T03013, 
X03558^ M28130,! Y(>6787, SEQ ID NO: 1 , SEQ It) NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ 
ID NO:5, or SEQ ID NO:29. 

16 • ; 'V' ' ■ ■ . ■ : - 

26. The method of claim 24, wherein said detection comprises antibody immundreaction with 
said polypeptide. 

27. The method of claim 26, wherein said detection comprises an ELISA, an 

1 5 immunoprecipitation, a radioimmunoassay, an immunohistochemical, Western blotting, dot 
blotting, or FACS analyses. 

28. Thb method of claiift 24; Nvhereiii said polypeptide is encoded by the IL-8 gene. 

20 29. The method of &aim 10 or claim 24, wherein skid marker is a product 6f the IL-8 gene 

and wherein said comparison is between two alternatively spliced forms of an IL-8 gene product 

3CK T^e method af cl^im 24, wherein the quantity of Iit- f 8 polypeptide in peripheral blood is 
measured using an in vitro bioassay that detects at least one It-8 mediated 

- ^31 ; ^^^ethod of cidiiii 29 wfieteiti said nikrkerS eoffipfe 
Y00787, SEQ ID NO:4 and SEQ ID NO:5. 

$2^^ prognosis ot Si^t^diis i3ls^^ ^ndMo^ I^^M&'S^^^isease 
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a) providing a first set of peripheral blood mRNAs from one or more subjects known 
to exhibit said disease state and a second set of peripheral blood mRNAs from one or more 
normal subjects; 

b) amplifying both sets of mRNAs to provide nucleic acid amplification products; 

c) comparing said sets of amplification products; and 

d) identifying those mRNAs that are differentially expressed between normal 
subjects and subjects exhibiting said disease state; 

wherein a difference in quantity of expression of an mRNA is indicative of a disease marker. 

33. The disease marker of claim 33, wherein the disease state is metastatic or organ confined 
cancer, asthma, lupus erythematous, rheumatoid arthritis, multiple sclerosis, myasthenia gravis, 
autoimmune thyroiditis, amyotrophic lateral sclerosis, interstitial cystitis or prostatitis. 



cancer. 



cancer. 



34. The method of claim 32, wherein the disease state is metastatic prostate 

35. The method of claim 32, wherein the disease state is metastatic breast 

36. The method of claim 32, wherein said subjects are laboratory animals... ... 

37. The method of claim 32, wherein said subjects are humans. 

38. A method of detecting prostate cancer in a biological sample, comprising: 

; measuring the levels of IL-8 in combination with at least one prostate disease 

marker in said sample; and 

(b) comparing said levels with corresponding levels obtained from reference 
populations ofnprmal individuals, mdividuaJs with BPH and individuals with prostate cancer: 

39. The method of claim 38 in which said prostate disease marker is selected from a group 
fisting of: total prostate specific antigen (PSA); prostate specific membrane antigen 
(PSMA=Folic Acid Hydrolase); prostate acid phosphatase (PAP); prostatic secretoiy, proteins 
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(PSP94); human kallekrein 2 (HK2); and the ratio of the concentrations' of free and bound forms 
of PSA (f^t PSA). 

40; Th6 method of claim 3 8 in which the biological sample comprises peripheral human 

•' : 5 V '- blood! ' . . . / "■■.r : .- i :.. -; v : 

41 . The method of claim 38 wherein the level of IL-8 in a biological sample is measured 
lising at least orife antibody that binds to at least one IL-8 gene product: ~ 

10 42. The method of claim 4 1 wherein the level of IL-8 gene product bound to antibody is 

j 

measured by ELIS A. 

43. The method of claim 38 wherein the level of IL-8 ii* a biological sample is meastKred 
using at least one oligonucleotide probe that binds to at least one IL-8 messenger RNA (mRNA). 

44; TOe method of ctiaii^ 

3. • * ' • • \ - 

45. The niethodbf claim 43 wherein the level of oligonucleotide probe to to H>8 mRNA 
20 is measured by nuclease protection assay; >: ^ 

-4^1 - Hifc methdd of claim 43 wherein the leVel of oligcMicteotide bouiid to IL^8 mRNA 
■ -: is measured by RT-PCR™. ; ? i 0;^^ 

25 47. The method of claim 43 wherem the level of oti^^^ to IL-8 mRNA 

- is measured by ligase chain rea^^ -■•'-v:---;- ^VvK-^A. 



' V4&;; InheM^fe^d bf^ai^43^ ilteiteiii^ to IL-8 mRNA 

is measured by PCR™. " " ^ ^ - v 



10 
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49. The method of claim 40 wherein the level. of D>8 in a biological sample is measured 
using an in vitro bioassay that detects at least one IL-8 mediated biological process; 

50. The method of claim 44 wherein the level of IL-8 in a biological sample is measured 

5 using at least one molecule that binds to an IL-8 gene product, wherein said molecule is selected 
from a group consisting of: an IL-8 binding protein; and an IL-8 receptor protein. 



: +■ 



51. The method of claim 48 wherein the level of prostate disease marker in a biological 
sample is measured using at least one antibody that binds to at least one prostate disease marker 
protein. 



52. The method of claim 51 wherein the level of prostate disease marker protein bound to 
antibody is measured by ELISA. 



15 53. The method of claim 39 wherein the level of prostate disease marker in a biological 

sample is measured using at least one oligonucleotide probe that binds to at least one prostate 
disease marker messenger RNA (mRNA). 



■ S4 ' method of claim 43 wherein the level of oligonucleotide probe bound to prostate 

20 disease marker mRNA is measured by nuclease protection assay. 

,p>,-j$Sj. Tlie method of claim 43 wherein the level of oligonucleotide probe bound to prostate 
disease marker mRNA is measured by RT-PCR™. , 



25. 



56. The method of claim 43 wherein the level of oligonucleotide probe bound to prostate - < 
disease marker mRNA is measured by ligase chain reaction. 

.57. The metfcodofclaim 43 wherein the level of oligonucleotide probe; bound to prostate 
disease marker nsRNA is measured by PCR™. 
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58. A method of differential ly diagnosing prostate Cancer and benign prostatic hyperplasia, 
comprising the step of measuring the levels of IL-8 in combination with at least one prostate 
disease marker in a biological sample. 

59. The method of claim 58 in which said prostate disease marker is selected from a group 
consisting of: total prostate specific antigen (PSA), prostate specific niembrM6 ajitigen 
(PSK4A==Folic Acid Hydrolase), prostate acid phosphatase (PAP), prostatic secretory proteins 
(PSP 94 ), human kallekrcin 2 ( HK2), and the ratio of the concentrations of free aihd bound forms 
of PSA (f/t PSA). r 

60. The method of claim 59 in which said biological! sample consi^ of peripheral human 
blood, 

61 . A kit for use in detecting a human disease, comprising: 

(a) a pair of primers for amplifying a disease state marker consisting of a nucleic 
acid; and 

(b) containers for each 6f said primers. 

62. A kit according to cimm 6 1 in which the pair of primers is iselebted tb hmplify a nucleic 
acid marker for metastatic human cancer. 




^ & kit ack&dihg to clafitii 62 f« which the jiair o^primei^ is sel^ctfed to amplify at Aucleic 
^cid h&vifig a sdtiubiice comprising at least a50base segment of Gienebanik Accession numbers 
D87451, T03013, X03558, M28130, Y00787, SEQ1#N0:1,S^ 
SEQ ID NO:4, SEQ IDNO:5, or SE$*D NO:29: * \v^^W^ : ; < ; , 



10 



20 
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(b) a pair of primers selected to amplify a nucleic acid sequence comprising SEQ ID 
NO:5 or Genebank Accession # M28130. 

65. A kit for use in diagnosing metastatic cancer in a biological sample, comprising: 

(a) an antibody which binds with high specificity to a polypeptide having an amino 
acid sequence encoded by a nucleic acid sequence comprising Genebank Accession numbers 
D87451, T03013, X03558, M28130, Y00787, SEQ ID NO: I," SEQ ID NO:2, SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:5,or SEQ ID NO:29. 

(b) a container for said antibody. 



66. A kit according to claim 65, further defined as comprising: 

(a) an antibody that binds with high specificity to a soluble IL-8 gene product; 

(b) an antibody that binds with high specificity to a membrane bound IL-8 gene 
product; and 

15 ( c ) a container for each antibody. 

67. A kit according to claim 65, wherein said metastatic cancer is metastatic prostate cancer. 

68. A kit according to claim 65, wherein said metastatic cancer is metastatic breast cancer. 



69. A kit for detecting or differentially diagnosing human prostate cancer, comprising: 
M at least one detection agent for measuring the levels of IL-8 ma biological sample; 



at least one detection agent for measuring the levels of at least one prostate disease 
, marker in said biological sample; and . 
25 (c) containers for each of said detection agents, 

70. The kit of claim 69 in which said prostate disease marker is, selected, from a grpup 
consisting of: total prostate specific antigen (PSA), prostate specific membrane antigen 
(PSMA=Folic Acid Hydrolase), prostate acid phosphatase (PAP), prostatic secretory proteins 



10 
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(PSP94), human kallekrein 2 (HK2), and the ratio of the concentrations of free and bound forms 
ofPSA(fftPSA). 

71 . The kit of claim 70 in which said detection agents are selected from a group consisting 
of: polyclonal antibodies; monoclonal antibodies; oligonucleotides; paired oligonucleotides 
designed to bind to opposite strands of a double-stranded DNA molecule; and at least one 
molecule that binds to an IL-8 gene product. 

72. The method of claim 16 in which said breast cancer marker is selected from a group 
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Figure 2 

Ability of Total PSA (ng/ra 1) to Distinguish BPH and 
Stages A, B, & C Prostate Cancer (n = 142) 

Area Under the Curve: 0.5995 
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Figure 3 
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Stages A, B, & C Prostate Cancer (n = 142) 
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Figure 5 

Ability of UC325 (pg/m 1) & T-PSA (ng/m 1) to Distinguish 
B PH and Stages A, B, & C P r 6 state Cancer (n = 142) 
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Figure 6 

A bility of U C325 (pg/m 1) & fit PSA Ratio to D istinguish 
B P H and Stages A , B , & C P rostate C ancer (n = 142) 
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